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Description 

This invention relates generally to human papilloma virus, and more particularly, relates to nucleotide sequences 
of short strands of human papilloma virus which can be amplified and/or used to determine the presence of human 
5 papilloma virus products in a test sample, and some ol which also can be amplified and/or used to determine the 
specific type of human papilloma virus of types 1 6 and 1 8 present in the tost sample 

Human papilloma virus (HPV) ) is recognized as a venereally-transmitted disease of the anogenital tract which 
often is associated with the pathogenesis of cervical cancer and its precursor lesions. More than 56 types of HPV have 
been characterized Of these, at least 21 types infect the anogenital tract L. Gregoire et al., J Clin, Micro 27 (12) 
W 2660- 2665( 1989), These mucosotropic viruses are associated most frequently with benign condyloma or latent in- 
fections. However, the presence of HPV in premalignant lesions and invasive cancers, particularly of the cervix, may 
reflect the oncogenic potential of these viruses. See P M. Howley, in Important Advances in Oncology , D. T. DeVita, 
Jr. et al., eds , J. B. Lippincott, Philadelphia, PA (1987) at pages 55-73. 

Certain HPV types, namely, HPV type 16 and type 18, and to a lesser extent HPV types 31, 33 and 35, are found 
is in a high proportion of invasive cervical cancers and their metastases. However, many HPV types which infect the 
anogenital tract, such as HPV types 6 and 11, are found most commonly in benign condyloma and only rarely are found 
in invasive cancers. HPV detected in the anogenital tract can be classified broadly as low risk papilloma viruses ( HPV 
types 6 and 11), intermediate risk papilloma viruses ( HPV types 31 , 33 and 35) or high risk papilloma viruses ( HPV 
types 16 and 18), based on the association of the particular HPV type with malignancy. A T. Lorincz et al., J Nat' I 
20 Cancer Inst. 79 671 (1987) Thus, the detection of the presence of HPV and the determination of the specific type of 
HPV can provide a diagnostic and prognostic tool useful for determining the clinical significance associated with certain 
HPV types The early detection of HPV by sensitive and specific reagents and methodologies also could provide earlier 
therapeutic management and counseling. 

A need therefore exists for accurate and reliable methods to identify and type HPV in clinical specimens. However, 
ss known polyclonal antisera prepared by immunizing animals with disrupted virionsare capable of detecting HPV antigens 
in only about 30-70% of cutaneous and mucosal warts. Further, the antisera are broadly cross-reactive. Available 
immunological tests have two major drawbacks. First, only well-differentiated cells apparantly are capable of viral 
antigen expression HPV-infected tissues which show higher degrees of neoplasia, such as carcinoma in situ , rarely 
contain HPV antigen. Thus, the further the development of the malignancy, the smaller the amount of detectable virus 
30 in the tested tissue. Secondly, these immunological tests are unable to identify specific viral types. 

It is known that papilloma viruses share amino acid sequences in the major capsid proteins. See, for example, C 
C. Baker, in The Papovaviridae (Vol 2), P. M. Howley and N P. Salzman, eds., Plenum Publ. Corp., New York (1987) 
at pages 321-385 The DNAs of this virus cross-hybridize, indicating homologous sequences. M F. Lawetal., J Virol. 
58 225-229 (1979). Thus, molecular hybridization techniques have been developed as a more sensitive and specific 
3S means of detecting and differentiating HPV DNA and RNA in clinical specimens. See A. T, Lorinez, Obstetrics and 
Gynecol. Clinics of N. America 14:451 ( 1987). 

Sequences specific for the DNA and RNA of human papilloma virus are known and have been published. See, for 
example, PCT application No. WO 89/69940 published October 1 9, 1 989, PCT application No. WO B6/0581 6 published 
October 9, 1986 and European Patent Application No. 0 301 968 published February 1, 1989 
40 The molecular hybridization techniques used to detect homologous DNA sequences are sensitive and can be 

highly specific if used with probes which bind to nucleic acid sequences which are unique to a particular HPV type 
However, the concentration of total viral DNA in a given clinical sample may be below the limit of sensitivity of the test. 
For example, the amount of viral DNA in dysplastic cervical lesions is reduced with increasing dysplasia. 

To overcome this problem of sensitivity, viral DNA sequences can be amplified by using, for example, the polymer- 
ts ase chain reaction (PCR) or the ligase chain reaction (LCR) techniques The products thus obtained can be identified 
by using conventional hybridization techniques for identification ol virus types, such as Souihem blotting. See C. Oste, 
Biotechniques 6:163( 1988), K B Mullis, U. S. Patent No 4.683,202, and EP-A-320 308 (BioTechnica). 

Both PCR and LCR serve to amplify the DNA present in a test sample to detectable levels In practice, the level 
of sensitivily is about 50 to 100 copies per sample. The next most sensitive technique is dot-blot, which can detect 
so about 10,000 molecules, while Southern blot reliably detects about 100,000 copios ol DNA per sample 

Thus, the appropriate diagnosis of HPV may require two steps. In one strategy, the presence of a clinically relevant 
type of HPV is first detected with a group-specific primer After the presence of HPV is detected, differentiation between 
types can be performed by using a type-specific probe having low homology between the HPVs of the group Alterna- 
tively, differentiation can be performed using a mixture of type-specific probes at the outset, provided these probes 
ss amplify DNA independently of each other, and that they can be detected independently. In the past, such tasks were 
attempted using spocific antibodies In general, nucleic acid probes and primers allow greater discrimination among 
subtypes than do antibodies. The use of DNA-based tests increases both sensitivity and specificity over prior-art an- 
tibody-based tests. 
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It therefore would be advantageous to provide oligonucleotide strands of DNA which could be amplified and used 
to detect the presence, if any, of HPV in a test sample It also would be advantageous to provide short oligonucleotide 
strands of DNA which could be amplified and used to detect the presence, if any, of specific types of HPV in the test 
sample. The combined use of oligonucleotide strands would be advantgeous for allowing for the specific and sensitive 
s in vitro diagnosis of the presence and specific type of HPV present in test samples 

SUMMARY OF THE INVENTION 

Oligonucleotides of from about 10 to about 60 nucleotides are provided which can be amplified and used either to 
io detect specific sequences of specific types of human papilloma virus, or consensus regions with high homology among 
different types. The presence of HPV is determined by contacting the test sample with sequences provided to detect 
the presence, if any, of HPV typos 6,11, 16, 1 6, 31 , 33 and 61 This may be done with or without prior amplification, 
for example, by PCR or LCR Either type-specific or consensus amplification is also possible Two oligonucleotides 
are provided if the sequence is to be amplified by PCR, and four oligonucleotides provided if amplification is by LCR, 
'5 in accordance with these known amplification procedures- After the presence of HPV is detected, the type of HPV 
present in the sample can be determined by using HPV type-specific probes, by subsequent rounds of PCR, or by 
LCR. Alternatively, the presence of type-specific HPV can be determined by contacting the test sample directly with 
type-specific nucleotide sequence provided by the invention for the detection of HPV types 16 and 18 Also provided 
are methods tor using the oligonucleotides and kits tor amplifying and detecting the presence of human papilloma virus 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a photograph of a gel following electrophoresis showing the results when the primers PCR 1 and PCR5 
were used to amplify selected plasm ids wherein HPV 6 is in lane 1, HPV 11 is in lane 2, HPV 16 is in lane 3, HPV 18 
ss is in lane 4, and HPV 31 is in lane 5, HPV 33 is in lane 6, HPV61 is in lane 7, and molecular weight standards are in lane 8. 

FIG. 2 is a photograph of a gel following electrophoresis showing the results when the primers PCR 1, PCR2, 
PCR3, PCR4 and PCR5 were used to amplify plasmid p65.16.8 (HPV 16). PCR1 and PCR5 are primers according to 
the invention. 

FIG. 3 is a photograph of the ethidium bromide-slained gels wherein PCR 1 4 and PCR15are used in conjunction 
30 with IWDO to obtain amplified PCR product. 

FIG. 4 is a graph of results obtained from performing LCR on 10 7 molecules of the selected targel using LCR5A, 
LCR5A', LCR5B and LCR5B'. Tho rate of reaction of 4-methyl lumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type 

FIG. 5 is a graph of results obtained from performing LCR on 10 7 molecules of the selected target using LCR6A, 
35 LCR6A', LCR6B and LCR6B' The rate of reaction of 4-methyllumbellilerone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 

FIG. 6 is a graph of results obtained from perfomring LCR on 10 7 molecules of the selected target using LCR7A, 
LCR7A', LCR7B and LCR7B'. The rate of reaction of 4-melhyllumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type 
40 FIG 7 is a graph of results obtained from perfomring LCR on 10 7 molecules of the selected target using LCRBA, 

LCR8A', LCR8B and LCR8B' The rate of reaction of 4-methyllumbelliferone is expressed as fluorescence counts/ 
second/second and plotted against the target HPV type. 

DETAILED DESCRIPTION OF THE INVENTION 

The appropriate diagnosis of HPV requires two sets of conditions. The first enables tho detection of all pertinent 
types, and the second set allows differentiation among them In the past, such tasks have been attempted using specific 
antibodies. In general, nucleic acid probes and primers allow greater discrimination among subtypes than do antibodies. 
Thus, the use of DNA-based tests tends to increase both sensitivity and specificity over antibody-based tests. 
so u. S. Patents No. 4,683,195 and 4,683,202 teach a method of amplifying DNA sequences by using PCR. This 

method now is a standard procedure in many molecualr biology laboratories Examples 1-3 which follow below utilize 
the procedures taught in these two patents and the method as described in the package insert of the commercially- 
available Gene-Amp ,M kit (Document No. 55635-6/89, Perkin-Elmer/Cetus, Emeryville, CA). 

In PCR, two complementary polynucleotide strands are amplified by treating the strands with two oligonucleotide 
primers such that an extension product of each primer is synthesized which is complementary to each nucleic acid 
strand. The primers are selected such that the extension product of one primer forms a template for the synthesis of 
an extension product from the other primer once the extension product of the one primer is separated from the template. 
A chain reaction is maintained by a cycle of denaturing the primer extension products from their templates, treating 
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Ihe single-stranded molecule genoraled with the same primers to re-anneal, and allowing the primers to form further 
extension products. The cycle is repeated for any many times as it takes to increase the target nucleic acid segments 
to a concentration where they can be detected 

The amplified target sequence can be detected by any of several known techniques: for example, by denaturing 
s the double-stranded products formed by PCR, and treating those products with one or more reporter probes which 
hybridize with the extension products. The reporter probe has a detectable label, and usually is added in excess. The 
unhybridized reporter probe, therefore, must be separated from the hybridized reporter probe by involving a separation 
step. In another method of detecting the extension products without reporter probe and a separation step, the extension 
products are detected by gels stained with ethidium bromide. The diagnosis can be confirmed by transferring the DNA 
to to nitrocellulose and probing with a probe specific to the HPV type suspected o1 being present in the sample. 

Alternately with PCR, one may take advantage of known restriction sites within the HPV DNA to demonstrate that 
the amplified DNA contains the expected sequence by examining the cleavage pattern(s) generated with one or more 
restriction endonucleases. Verifying the authenticity of the amplified sequence may be necessary for two reasons: (1 ) 
to ensure that sequences complementary to the amplifying primers are not fortuitously present in cellular DNA which 
is does not contain HPV DNA, and (2), to identify the type of HPV present in the sample. If the sequences chosen for 
amplification are conserved among HPV types, then the finding of an amplified product does not implicate a particular 
HPV type. It also should be possible to predict the size of the amplified product based on the binding positions of the 
two primers. Thus, when that product is found, one reasonably can be assured that HPV is present. However, two 
different types of HPV may give the same or different size products. Thus, hybridization should be used to confirm the 
so identity of the amplified sequence until confidence is built that the interpretation of the results is reliable. It should be 
pointed out that the PCR technique will identify only closely related, or type-specific sequences in the absence of highly 
homologous primers, since only a small portion of the genome is analyzed 

Another particularly useful detection technique is described in EP-A-357 011 In this method, a different reporter 
molecule, e.g. hapten, is attached to each primer. Following amplification, but before denaturation. duplexes can be 
2S detected by "capturing" one hapten (haptenl) with a soled phase coated with anti-hapten1 . The separated complex 
can bo detected with a conjugate of label and anti-hapten2. and label associated with tho solid phase can be measured 

The Ligase Chain Reaction (LCR) amplifies sections of DNA by copying the section of DNA, and copying the 
copies of that section of DNA, many times over. This method is described in European Patent Application No 0 320 
308 published June 14, 1989, which is incorporated herein by reference. In this procedure, two probes (for example. 
30 A and B) complementary to immediately adjacent regions of a target soquence are hybridized and ligated This ligatad 
probe then is denatured away from the target, after which it is hybridized with two additional probes (A' and B') of sense 
opposite to the initial probes A and B. The secondary probes are themselves then ligated. Subsequent cycles of de- 
naturation/hybridization/ligation create the formation of double-length probes of both sense (+) and anfisenss (-). 

In LCR, the nucleic acid of the sample is provided either as single stranded DNA or as double-stranded DNA which 
3S is denatured to separate the strands. Four probes are utilized: 1he first two probes (A and B) are the so-called primary 
probes, and the second two probes (A and B') are the so-called secondary probes. The first probe (A) is a single strand 
capable of hybridizing to a first segment of the primary strand of the target nucleotide sequence The second probe 
(b) is capable of hybridizing to a second segment of the primary strand of the target nucleotide sequence The 5' end 
of the first segment of the primary strand of the target is positioned relative to the 3' end ol the second segment of the 
to primary strand of the target to enable joining of the 3' end of the first probe to the 5' end of the second probe, when 
the probes are hybridized to the primary strand of the target nucleotide sequence. The third probe (A) is capable of 
hybridizing to the first probe, and the fourth probe (B') is capable of hybridizing to the second probe (B). The hybridized 
probes are ligated to form reorganized fused probe sequences Then, the DNA in the sample is denatured to separate 
ligated probes from sample DNA. Successive cycles wherein the ligated probes and target DNA undergo the above- 
*s described process are performed to increase the amount of detectable DNA in the sample. The amount of cycles 
performed is dependent upon the sequence used and the sensitivity required of tho test Usually, the cycle can be 
repeated from 15 to 60 times. At least one of the probes can be conjugated to a signal generating compound. 

If the four probes are conjugated to appropriate binding members, the detection of amplified product can be ac- 
complished using standard manual or automated immunoassay procedures known to those skilled in the art. These 
so procedures include, for example, immunochromatography, ELISA, EIA and MEIA. Hybridization also can be accom- 
plished by following standard dot-, slot- or replica-blot procedures which are known to those in the art. The sequences 
can be labelled with an appropriate signal generating compound (label), which is capable of generating a measureable 
signal detectable by external means. The various signal generating compounds contemplated include chromogens, 
catalysts such as enzymes, luminescent compounds such as fluoroscein and rhodamine, chemiluminescent com- 
55 pounds, radioactive elements such as 32 P, and other labels known to those of ordinary skill in the art. The selection of 
a particular label is not critical, but it will be capable of producing a a signal either by itself or in conjunction with one 
or more additional substances. A variety ot different indicator reagents can be formed of label and specific binding 
member. Either the label or a specific binding member can be varied. Examples of specific binding members which 
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can be jsed as a member of the indicator reagent include antibodies, both monoclonal, polyclonal, and fragments 
thereof; avidin or biotin, biolin and anti-biotin, a carbohydrate or a lectin, a complementary nucleotide sequence, an 
effector or a receptor molecule, an enzyme cofactor or an enzyme; an enzyme inhibitor or an enzyme; also any antigenic 
substances, haptens, antibodies, and combinations thereof. 
s The test sample can be any biological material suspected of containing HPV, Thus, the test sample can be human 

body tissue, or a test sample which contains cells suspected of containing HPV 

The invention will now be described by way ol Examples, which are meant lo describe, but not to limit, the spirit 
and scope of the invention. 

The following terms used in the examples are trademarks, tradenames or chemical abbreviations as specified: 

w 

TRIS - chemical abbreviation for (tris(hydroyxmethyl)ammomethane), used as a buffer. 

EDTA - chemical abbreviation for ethylenediaminetetraacetic acid, a chelating agent. 

FITC • chemical abbreviation for fluorescein isothiocyanate, a flourescent hapten derivative. 

NHS-ester - chemical abbreviation for N-hydroxysuccmamide ester 
>s MES - chemical abbreviation tor [2-|N-morpholino)ethanesulfonic acid], a buffer 

TWEEN®-20 - trademark of Atlas Chemical for polyoxyethylene sorbitan monolaurate, a detergent. 

BIS-TRIS - chemical abbbreviation tor [bis-(2-hydroxyethyl )-amino]1ris-(hydroxymethyl)methane, a buffer. 

TRITON X- 100® - trademark of Rohm & Haas for nonaethylene glycol oclylphenol ether, a detergent. 

IMx® - trademark of Abbott Laboratories for an automated instrument for perlorming microparticle enzyme immu- 
20 noassay (MEIA). 

EXAMPLES 

EXAMPLE 1 

25 

PCR was performed essentially following the package insert ol the commercially available Gene-Amp™ kit (doc- 
ument No 55635-6/89, available from Perkin-Elmer/Cetus. Emeryville, CA) The following reagents were mixed in a 
0.5 mL polypropylene tube and used in performing PCR 



Reagent 


Final Concentration 


Water 


(to give final volume = 50 or 100 u.L) 


Reaction Buffer 


10mMTRISpH8.3 




50 mM KC1 




1.5 mM MgC12 




0 01% gelatin 


dNTP mixture 


200 nM each of dATP.dCTP.dGTP, and TTP 


pCR1 


1 u.M 


pCR2 


1 u.M 


plasm id 


10 nL 1 ng/100uL 


(or control-human placental DNA (Pooled Placental DNA, catalog D-32B7, Sigma Chemical Co, St. Louis MO). 


DNA polymerase, 




Thermus Acquaticus 


25 or63.9 units/1 mL 



After mixing, the reaction mixture was overlayed with 100 uL of mineral oil. The tube then was placed in an instru- 
ment capable of incubation at several temperatures, and subjected to 30 or 40 cycles of programmed temperature 
change. The precise cycle of temperature change used, and the instrument used, varied with the experiment, and is 
detailed in the descriptions of the figures in Example 3. 



EXAMPLE 2 

Following the procedure of Example 1 . the following sequences were found to amplify sections of papilloma virus 
types 6, 11, 16, 18, 31 33, and 61 using PCR. 

PCR!: CAGATGTCTC TGT3GCGGCC TAGTG ( I D No. 1) 
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PCR5: AGGTGTCAGG AAAACCAAAT T TAT T (ID No. 5) 

PCR I 4: GAATTAGTTA GACCATTTAA AAG ( I D No 6) 

5 

PCR I 5 ; GGGGAAACAC CAGAATGGAT A (ID No. 7) 

IWDO: A T CAT AT GCC CACTGT ACCA T (ID No. 8) 

Sequence IWDO is derived 1rom a sequence disclosed in International application number PCT/US86/00629 (WO 
10 86/05816). 

TABLE 1 shows the sequences and where they map to to in the various types 



TABLE 1 

SEQUENCES WHICH CAN BE USED AS PROBES OR PCR PRIMERS 
iPRcet seq sequence sense haps m maps to-. iiaps to: maps to. maps to: maps to-. 

ID No. 

(typetn ttyptlb) (tvp»18) ( tvP" 3 1 ) Ctyp«33) 

PCftl: I CAGAroTCTCTQTOOCOGCCTAGTO . S766-S8I0 S768-S792 S634-S6S6 5610-5634 SS50-SS74 559 1 -561 S 

PCR2: 2 COTTTTCCATATTTTTTroCAOATO . S767-S79t S749-S773 615-5639 5591-5615 5531-S555 5572-5596 

OPCRJ: 3 AAGTTOTAAGCACCGATGAATArGT . SB44-Se6« 5826-58SO 69S-S7I9 5671-569S 5611-5635 56S2-S676 

PCR4: 4 AATGTACCCTAAATACCCTATaTTO - 6C0fl-S984 S990-S966 865-S841 S841-S6I7 5784-5760 5825-S8C1 

PCR5: S AGO t Q7CAGGAAAACCAAA T T TAT t - 6044-6020 6026-6OC2 S90I-5877 S877-S6S3 5820-5796 5661-5837 

PCft I 4: 6 GAAT T AOT T AGACCAT T TaaaaO • I49S-IS17 M9S-ISI7 IS24-IS46 IS9S-1617 1462-1484 I5I8-IS40 

PCRIS: 7 GGGGAAACACCaGaaTGGa TA • I834-I8S4 I834-18S4 1863-1583 1934-1954 ',801-1821 1857-1877 

5IWD0: 8 ATCATATGCCCACTGTACCAT - I93I-I9II 1931-1911 1960-1940 2031-2011 1898-1878 1954-1934 

note: PCR2, PCR3 and PCR4 are not probes or PCR primers of the invention 
EXAMPLE 3 

Linearized plasmids containing full-length papilloma virus inserts in pGEM3 were used as targets. These were 
pHPV6.1 ( HPV6), pSP65.11.5 ( HPV 11), p65 16,8 (HPV16), pHPV1BH(HPV18), pG3 HPV31 ( HPV31), 
pLNK322,HPV33 ( HPV33), and pBH322 HPV61 (HPV61) The Programmable Cyclic Reactor™ (available from Eri- 
comp, San Diego) was used as the incubation instrument. Following PCR procedures as described in Example 1,10 
uX aliquots were analyzed by electrophoresis through agarose (comprising a 5:3 ratio of NuSieve®:SeaKem® GTG, 
available from the FMC Corp., Rockland, ME) in a buffer comprising 0.089 M TRIS, 0.089 M borate, 2 mM EDTA, and 
0.5 ppt ethidium bromide. 

FIG 1 is a photograph of an ethidium bromide-stained 1 2% agarose gel showing results using 63 9 units/mL DNA 
polymerase, in the DNA Thermal Cycler™ (Perkin-Elmer/CETUS. Emeryville, CA). The samples were heated for 5 
minutes at 94°C, then subjected to 40 cycles of a temperature program of: 1 minute at 94°C, 2 minutes at 40°C, and 
1 .5 minutes at 72°C The PCR primers used in this case were PCR 1 and PCR5 of Example 2 Examination of the gel 
following electrophoresis showed bands at the expected oositions, i.e. 292 bp. Lane 1 , HPV6; lane 2, HPV 1 1 ; lane 3, 
HPV16 ; lane 4, HPV 18; lane 5, HPV31; lane 6, HPV33, lane 7, HPV61, lane 8, pooled human placental DNA ( sus- 
pectod of having HPV infection); lane 9, molecular weight markers Hae III digest of <J>X174 

FIG 2 is a photograph of an ethidium bromide -stained 4% agarose gel showing results using 25 units/mL DNA 
polymerase, in the Programmable Cycler Reactor™ ( Ericomp, San Diego, CA). Samples in this case were subjected 
to 30 cycles of a temperature program of: 50°C for one (1) minute, 72°C for two (2) minutes and 95°C for one (1) ) 
minute In this case, the primers PCR1 , PCR2, PCR3, PCR4 and PCR5 of Example 2 were used to amplify plasmid 
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p65,16,8(HPV 16). Examination of the gel ol Figure 2 shows bands at the expected positions, i.e., PCR 1 and PCR4. 
235 bp, lane 2; PCR1 and PCR5, 267 bp, lane 4; PCR2 and PCR4, 254 bp, lane 6; PCR2 and PCR5, 286 bp, lane 8; 
PCR3and PCR4, 174 bp, lane 10; PCR3and PCR5, 206 bp, lane 12; molecular weight marker, 123, 246, 369, 492,. 
bp ladder, lane 1 . Note footnote to Table 1 

s FIG. 3 is a photograph of an ethidium bromide-stained 1 2% agarose gel showing results using the same conditions 

as FIG. 1 In this case, PCR 14 and PCR15 were used as primers in conjunction with IWDO. The expected size of the 
amplified PCR product of PCR 14 and IWDO is 437 bp for all of the HPV types tested The expected size of the product 
of PCR 15 and IWDO is 98 bp. Products of these sizes appear in the gols, confirming that PCR14 and PCR15, used 
in conjunction with IWDO, will amplify HPV DNA of typos 6, 11. 16, 18, 31 , 33, and 61. Lane 1, Molecular weight 

10 marker (Hae III digest of FX 174): PCR 14 + IWDO. lanes 2-9 lane 2, HPV6 , lane 3, HPV 11 . lane 4, HPV16 ; lane 
5, HPV18 ; lane 6, HPV31 ; lane 7, HPV33; lane 6, HPV61 ; lane 9. human placental DNA suspected of being infected 
with HPV; PCR 5 + IWDO, lanes 10-17: lane 10, HPV6; lone 11 , HPV 11; lane 12. HPV16; lane 13, HPV18 ; lone 14. 
HPV31 ; lane 15, HPV33 ; lane 16, HPV61 ; lane 17, human placental DNA suspected of being infected with HPV: lane 
18, molecular weight marker ( Hae III digest of FX174 and HinD III digest of 1 DNA) 

ts 

EXAMPLE 4 



The following reagents were mixed in a 0.5 mL polypropylene tube as follows for the Ligase Chain Reaction (LCR): 



Reagent 


Volume 


Final Concentration 


Water 


21 uL 




Reaction Buffer 


10uL 


50 mMEPPS pH7.6 






10 mM NH 4 CI 






10 mM MgCI 2 






100 mM K + (from all sources) 






0.001% BSA 






1 mM DDT 


Nicotine Adenine Dinucleotide (NAD) 


0.5 uL 


100 uL 


ProbeA (sense) 


4 uL 


5.0 x 10 11 molecules 


ProbeA' (antisense, 5'-phosphate) 


4u.L 


7.5 x 10" molecules 


ProbeB (sense, 5'-phosphate) 


4uL 


7 5 x 10" molecules 


Probe B' (antisense) 


4 u.L 


5 Ox 10" molecules 


Target (including human placental carrier DNA at 10 u.g/mL) 


1.5 uL 


16 ng/50 u.L 


DNA ligase, Thermus therpophilus 


1 u.L 





This reaction mixture was overlayed with 30 u.1 of mineral oil The tube was placed in an instrument capable of 
incubation at several temperatures ( e.g. thermal cycler (rom Coy Laboratory Products (Ann Arbor, Ml) or the Program- 
mable Cycler Reactor™ (available from Ericomp, San Diego, CA), and then subjected to several cycles of programmed 
temperature change Each cycle invofved incubation at 50°C for one minute and 85°C for one minute. 



EXAMPLE 5 

The following procedure was used when performing the Ligase Chain Reaction ( LCR), which is described in 
published European Patent Application No. 0 320 308 A2. The reagents of Example 4 were utilized in the procedure 
as follows: Two probes (A and B) complementary to immediately adjacent to regions of a target sequence were hy- 
bridized and ligated. This ligated probe was denatured away from the target, and hybridized with two additional probes 
(A' and B') of sense opposite to the initial probes (A and B) The secondary probes then were ligated Subsequent 
cycles of denaturation/hybridization/ligation created the formation of double-length probes of both + and - sense. 

EXAMPLE 8 

Ths following sequences were determined to be specific for a portion of the E6 region of HPV type 16: 
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Probe 


SFOIDNo. 


Sequence 






Mans to: 


LCR5A 


81 


GCTGCAAACA 


ACT ATACATG 


ATATAA 


157 - 182 


ICR5/V 


82 


pTTATATCATG 


TATAGTTGTT 


TGCAGC 


182 - 157 


LCR5B 


83 


p TAT TAGAATG 


TGTGTACTGC 


AAGCA 


183 - 208 


LCR5B' 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA 


208 - 157 



EXAMPLE 9 

Base-denatured plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets 
These plasm ids were pG3HPV6(+) (HPV6), pSP 65. 11.5 (HPV11), pSP65.168 (HPV16), p63HPV18H(-)(HPV18), 

W P 63:HPV31 (HPV31 ), pLNK322:HPV33 (HPV33), pBR322:HPV35 (HPV35), pUC19.HPV52 (HPV52), pLNK322: 
HPV58 (HPV58), pUC9 HPV59 (HPV59) and PBH322 HPV61 (HPV61) All of the oligonucleotides used as probes 
trom Example 8 had chemical labels covalently attched at the ends distal from ligation. These labels were: 5'-fluores- 
cein-LCRSA, 3'-fluorescein-LCRSA', 3'- biotin-LCR5B and 5'-biotin-LCR5B\ Covalent attachment was performed by 
known methods, i.e., reaction of amine-terminated oligonucleotides with FITC orbiotin-NHS-ester essentially following 

20 the procedures of Kansal et al , Tet. Letters 29 5537-5540 (1933) The thermal cycler used was obtained from Coy 
Laboratory Products, Ann Arbor, Ml. 

Following the LCR procedure of Examples 4 and 5, the mix'.ures were analyzed using a prototype version of the 
IM X ® instrument (Abbott Laboratories, Abbott Park, IL), following the protocol for microparticle enzyme immunoassays 
as follows A 40pL aliquot of an LCR mixture was diluted 11 with distilled water This diluted mixture was incubated 

25 with 50u.L antifluorescoin-conjugated polystyrene microparticles for five (5) minutes to form a suspension of immune 
complexes on the microparticles. This suspension then was transferred to an inert glass fiber matrix, to which the 
microparticles became attached The matrix was washed with buffer (0.3M Nad, 10 mM TRIS pH8, 0,1%NaN 3 ), Any 
immune complexes attached to the glass matrix was detected by using alkaline phosphatase-labeled conjugate that 
catalyzed the hydrolysis of 4-methylumbelliferone. The rate at which the 4-methylumbelliferone was generated on the 

3£> matrix was proportional to the concentration of LCR product formed in the reaclion mixture. 

Referring to FIG. 4, the graph shows the results obtained from performing LCR on 10 7 molecules of the targets in 
shown. The rate shown is the rate of generation of 4-methylumbelliferone, and is expresssed as fluoresconce counts/ 
second/second Background signal is approximately 10 c/s/s, as shown by the amplification of human placental DNA. 
The only values above background are those for sample containing HPV16, and those values are about 60 times 

as background signal 

EXAMPLE 10 



The following sequences were determined to be specific for a portion of the E6 region of HPV type 1 8: 



Prprje 


SEP. ID No. 


Sequence 






flaos to; 


LCR6A 


85 


CT TCACTGCA 


AGACAT ACAA 


AT AA 


I 72 - I 95 


LCR6A' 


86 


pTTATTTCTAT 


GTCTTGCAGT 


GAA 


I 95 - 173 


LCR6B 


87 


pCCTGTGTATA 


TTGCAAGACA 


GT AT 


196-219 


LCR6B' 


88 


TACTGTCTTG 


CAATATACAC 


AGG 


218 - 196 



EXAMPLE 11 

50 

Plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets. The plasmids used 
were those described in Example 9 All of the oligonucleotides used as probes oblained from Example 1 0 had chemical 
labels covalently attached at the ends distal from ligation. The thermal cycler was obtained from Coy Laboratory Prod- 
ucts, Ann Arbor, Ml. 

ss Following LCR procedure described in Examples 4 and 5, the mixtures were analyzed as described in Example 9 

using the prototype version of the IM„® instrument (Abbott Laboratories, Abbott Park, IL). 

Referring to FIG. 5, the graph dislays the results obtained from performing LCR on 10 7 molecules of the targets. 
The rate shown is the rate of generation of 4-methylumbelliferone, and is expressed as fluorescence counts/second/ 
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second. Background signal is approximately 1 5 c/s/s, as shown by the amplification ot human placental DNA. The only 
values above background are those for sample containing HPV 8, and those values are about 40 times background 
signal 

5 EXAMPLE 12 

The following sequences were determined to be specific for a portion of the E6 region of HPV type 18. 



Probe 


SFO ID No. 


Sequence 






Maps to: 


ICR7A 


89 


TAT AT TGCAA 


GACAGTATTG 


GAAC 


200 - 223 


LCR7A' 


90 


pOTTCCAAT AC 


TGTCTTGCAA 


TTTA 


223 - 200 


LCR7B 


91 


pT TACAGAG6T 


ATTTGAATTT 


GCATT 


224 - 249 


LCR7B' 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA 


249 - 224 



EXAMPLE 1 3 

so Plasmids which contained full-length papilloma virus inserts in pGEM3 were used as targets The plasmids were 

those of Example 9 All of the oligonucleotides from Example 12 which were used as probes had chemical labels 
covalentfy attached at the ends distal from ligation. The thermal cycler was as described in Example 11. 

Following the LCR procedure of Examples 4 and 5, the mixtures were analyzed as described in Example 9 using 
the prototype version of the IMx instrument (Abbott Laboratories, Abbott Park, IL). 

2S Referring to FIG. 6 , the graph shows the results obtained from performing LCR on 10 7 molecules of the targets. 

The rate shown is the rate of generation of 4-mothylumbelliferone, and is expressed as fluorescence counts/second/ 
second. Background signal is approximately 15c/s/s as shown by the amplification of human placental DNA. The only 
values above background are those for sample containing HPV 18, and those values are about 80 times background 
signal. 

EXAMPLE 14 

The following sequences were determined to be specific for a portion of the E6 region of HPV type 16 



Probe 


SFQ ID No. 


Sequence 






Maps to 


ICR8A 


93 


GTATGGAACA 


ACAT TAGAAC 


AGCA 


352 - 


375 


LCR8A' 


94 


pTGCTGTTCTA 


ATGTTGTTCC 


AT AC 


375 


352 


LCR8B 


95 


pATACAACAAA 


CCGTTGTGTG 


AT T T 


376 


399 


LCR8B' 


96 


AAAT CACACA 


ACGGTTTGTT 


GTAT 


399 


376 



4S EXAMPLE 15 

Plasmids which contained lull-length papilloma vi'us inserts in pGEM3 wore used as targets. All of the oligonucle- 
otides from Example 14 used as probes had chemical labeis covalentiy attached at the ends distal from ligation The 
thermal cycler was as described in Example 11 

SO Following LCR procedure of Examples 4 and 5, the mixtureswere analyzed as described in Example 9 using the 

prototype version of the IM X ® instrument (Abbott Laboratories, Abbott Park, IL). 

Referring to FIG. 7 , the graph details the results obtained from performing LCR on 10 7 molecules of the targets. 
The rate shown is the rate of generation ol 4-mothylumbelliferone, and is expressed as fluorescence counts/second/ 
second. Background signal is approximately 10 c/s/s, as shown by Ihe amplification ot human placental DNA. The only 

ss values above background are those for sample containing HPV 16, and those values are about 36 times background 
signal. 
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EXAMPLE 16 

The attached Appendix (example 16) discloses the sequences of the invention aligned to known sequences. 
s EXAMPLE 16 
APPENDIX 

HUMAN PAPILLOMA VIRUS 

ALIGNMENT ot TYPES 6 11,16.18,31 and 33, with CONSENSUS SEQUENCE 

The appendix lists the sequences of HPV types 6, 11, 16, 16, 31, and 33 It also shows where the sequences of 
this invention line up with respect to these HPV sequences. In addition, the appendix shows where other sequences. 
>s known to the Inventors as of 28 September 1990, and claimed or disclosed by or unknown to others, line up with 
respect to these sequences. 

1. Sequences and Regions Claimed by Us, 

20 PCR = Sequences per examples 1 through 3 (only PCR1, PCR5 PCR14 and PCR15) 

LCR = Sequences per examples 4 through 1 4 only 

2. Sequences and Regions Unknown to Others and Not Claimed by Us, 

ss 

PCR = Sequences designated PCR other than those above JJ 

LCR = Sequences designated LCR other than those above 

30 3 Sequences and Regions Claimed by Others; 

(Italics represents anlisense sequences) 

AUS - International application number (Australians) PCT/AU88/00047 (WO 88/06634) 

35 WL = International application number (Wayne Lancaster, Wayne State University) PCT/US86/00629 

(WO 86/05816) 

BE ^ European Patent Application (Belgians) 89.033834 (X= T or U) 
40 c= International application number (CETUS) PCT/US89/03747 (WO 90/C2821) 

O = International application number (Oncor) PCT/US89/01318 (WO 89/09940) 
and 

4. Sequences and Regions Disclosed by Others. 

S = Sarkar, F.H, and Crissman. J.D. Biotechniques 9 180-184 (1990) (Italics represents antisense sequences) 



ss 
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gTTAATAACAATCTTgCTTTAA AAAAtAGGAGCG 

iiiiiiiiiiiiii unit mi mini 

cTTAATAACAATCTTAGTTTAA AAAAgAGGAGGG 

I II I I Mill I 



ACCGAAA ACGGTTCAACCGAAAA 
I I I I I I I I II I I I I I I I I I I I I I 
ACCGAAA ACGGTTCAACCGAAAA 

mini minimum 

gtaaACTAT.\ATgCcaAGTTTTA AAAA AG tAGGGtGTAACCG AAA gCGGTTCAACCGAAAA 

n inn i i n in mi mnmn nm mmii 

actACAATAAT tcAtGTATA AAA ctaAGGGcGTAACCGAAA tCGGTTGAACCGAAAc 



i mm 

TAATA ATAATAAT 
III I 



mi ii mini i iiininn 

CtTAGTATA AAA AAgtAGGGAGTgACCGAAA GtGgTGAACCGAAAA 

"""* 'ii n mini mini i i nnnn 



atTAATActTttaAcaattgTAGTATAtAAA AA AGGGAGTaACCGAAAacGgtcgGgACCGAAAA. 

--taatata-ta-aa-tcttag-T-tA-AAAaaag-ACGGagtaACCGAAA-acggtt-aACCGAAAa 

C4-CCCAASTTGGCTTTT 
CS-GCCAGCCCTGGCTTTT 
C3 6-CGGTTSAACCGAAAA 
C37-CGCTCCGGACCGAAAA 
C3 8-CGGTTSAACCGAAAX 
C3 9 -CGGTTCAACCGAAAM 
015-ATTAATACTTTTAACAATTGTAGTATATAAA AA AGGGAGTAACCGAAAACGGTCGGGACCGAAAA-015 
024- ACTACAATAAT TCATGTATA AAA CTAAGGGCGTAACCGAAA TCCGTTGAACCGAAAC-02 4 

S 1 -CGGTCGGGACCGAAAA 
S3-ACCGAAAC 

58 CGGTTgTATATAAA CCACCCCtAAAAtTTACCAaACGACGCATTATGGAAAGTgcAaATCCCTCCAC 

nm ilium mini mi mm mmmminiin i nniimi 

58 CGGTTafATATAAA CCAGCCCAAAAAaTTAGCAGACGAGGCATTATGGAAAGTaAAGATGCCTCCAC 

llll II II II II III I I (III I 

62 CGGTgcaTATAtAAAGCA aACATTTTgcagtaAGgtActGCACgACtATGTTTCAAGACaCtgAGGA 

iiii mi mm mini i i n i nnnn m i nn 

S8 CGGTTaGTATA AAACCA gACATTTTaTG c aCc AAaAgAG AAC t GC a ATGTTTCCa cAGG A 

nm nm mm i n n i n i i n inn i mi ii 

60 CGGTTgGTATATAAAGCAcataGTaTTtTGtgCaAAccTACAgacGCcATGTTcaAaaATCCtgCAGA 

mi miimi i i i iii ii n i mi n 

66 CGGT GTATATAAA agafcgtGagaaacacaCcAcaaTACtatgCCgcgcTTtgAggATCCaaCAcg 

CGGTt-gtatatAAagcag — ca-a~at — gcaaaca-agcatt-cgatgttt-aagAtcC — c-ga 
GCC-C4 A0S1-ATGCCTCCAC 
GCC-C5 

CGG-C36 C67-AAATCCTGCAGA 
CGG-C3 7 C6 8 -CCTACAGACGCCATGTTCA-C6 8 

CCG-C38 C71-GCAGTAACGTACTGCAC-C71 

CGG-C39 O10-CCATCCAACACG 
015-CGGT GTATATAAA AGATGTGAGAAACACACCACAATACTATGGCGCGCTTTGAGGATCC-015 
02 4 -CGGTTAGTATA AAAGCA GACATTTTATGCACCAAAAGAGAACTGCAATGTTTCCACAGCA ( -Q2 4 ) 

CGGTG- SI S 2 - CCGCCCGAAACTCCTAGGTTGTGC- S 2 

CGGTTAGTATA AAAGC-S3 
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125 OTCTGCAACg*CeXTAGACCAGTTGTGCAAGACSTTTAATCTaTCT»TGCAtACgtTGCAAATTaAtT 

iiimm i uiiuiiiiiiiiiiiiiiiiMiiiii iii mi it muni i i 

125 GTCTGCAACAtCtXTAGACCAGTTGTGCAAGACCrrTAATCTtTCTtTCCACACCcTGCAAATTCAST 

it ii i i i mm i i it i it i mi i n i mi 

129 »aaACCAcGAACa'JTgCAtgAtTIGTGCcAAGC*rrCgAgACAACTATACACAAcATtgAAcTACAGT 

m m ii i mi in ii ii i immtm i n i n i i 

124 gcGACCeaGAAAgTTaCcacAgKATGCaCAGagcTGcAAACAACTATACATGAtATAAtATTAgAaT 
llll I II II I I II II I I II III ! I llll III III I I 

128 aaGACCtcGgAAaTTgCaTGAaCTAaGCtCGGcAtTGgAAACAcCctacgATGAacTAAgATTgAAtT 
llll II I I III II II III I II I I I III II III I 

131 gcGACCctacAAgcTaCcfGAtCTgtGCaCGGaAcTGaAcActtCactgcAaGAcaTAgaAaTnAccT 

• g-gaccaagaa — tTacat-AgtTgtGCa-gge-tTgaA-»-atCtatgcAt-a-aTa-aAaTaaa-T 
GTCTGCAAC-AUS1 A0S7-GCAAGACGTTTAATCT-AUS7 

AAGACCTC-C67 C7 4 - ACACTCTGCAAATTCAGT 

010 -GCGACCCTACAAGCTACCTGATCTGTGCACGGAACTGAACACTTCACTGCAAGACATAGAAATAACCT-0 1 0 
024-GCGACCCACAAAGTTACCACAGlTATGCACAGAGCTGCAAACAACTATACATGATATAATATTAGAAr-O2 4 

54- CTGGGTCTTTCAATGGrGrCAAM-S4 

19 3 GtGTGTTTTGCAaGAATGCACTGACCACaGCAGAGATtTATtCATATGCaTATAAacACCTAAAGGTc 

i niimm imiimmm mum m limn mn minim 

193 GCGTGTTTTGCAgGAATGCACTGACCACoGCAGAGATATA'rGCATATGCcTATAAgaACCTAAAGGTT 

inn iiii ii i ii i in mm i m i i i m in 

197 GCGTGgAaTGCAAaAAACCtTTGCaaCGAtCTGAGGTATATGAt.TTTGCaTTTgcaGATTTAAcaGTT 

I m i mn iii ii i in iimimin mn in mm i 

192 GTGTGTACTGCAAgcAACAGTTACtgCGAcgTGAGGTATATGAcTrrGCtTrrcggGATTTAtgcATA 

iiii miiiii mm i mini n mn m mm in 

196 GTGTcTACTCCAAaggtCAGTTAacAgaAACACAGGTATTaGAtTTTGCATTTAcAGATTTAaceATA 

im ii mn n i imimiii n inmmi mini i 

199 CrGTaTAtTGCAAgaCagtaTTggaActtACAGAGGTATTtGAaTTTGCATTTAaAGATTTAtttgTg 

GtGTgtatTGCAagaa — catTgacac-a-caGAGgTaTatgaaTtTGCaTtTaoagAttTAa — gT- 
AUS2 - TACGTGACTGGrGGCCGTCTC-A0S2 C73-ACACCTAAACGTC 
GC-C74 AOS3-TGAGGTATATGACTTTGCTTTT-AUS3 

C60-GAGGTATWTGAHTTTGC-C6D 01-CTAAAGGTT 
C6 1 -GAGATWT ATKCATATGC-C 6 1 02 -CTAAAGGTT 
C6 9-ACAGTATTGGAACTTACAG-C69 04 -GATrTCCA* 

C70-CAACAGTTACTGCGACG-C7Q 06-TTATGCATA 
C 7 2 -G ACAGTATTGG AACTTACAG-C 7 0 07 -TTATGCATA 

55- GTGTTTTCCAGGAATGCACTGACCA-S5 08-AATACCTAT 

0 1 0-GTGTATArrGCAAGACAGTATTCGAACTTACAGAGGTATTTGAATTTGCATTTAAAGAirrATTTGTG-0 1 0 

01 1- TTATTTGTG 

012- TTATXTGTG 
013 - AATAAACAC 

017- CTAAAGGTC 

018- CTAAAGGTC 
020-6ATTTCCAC 

024-GTGTGTACTCCAAGCAACAGTTACT5CGACCTCAGGTATAXCACTTTCCrTTTCCCGATTTATGCATA-02') 

025- 



4S 



SO 



ss 



12 



EP 0 477 972 Bl 



261 cTGTttCGACgCggCTaTCCaTaTCCAGCcrGcGCgTGcTGCcTAGAAtTtCAtGGaAAAATaAACCA 

iii mi i ii iii i mill it ii ii in inn i ii ii urn dim 

261 GTGTggCGAGACaaCTtTCCcTTTGCAGCgTGTGCcTGtTGCTTAGAAcTgCAAGGgAAAATTAACCA 

ii i mi iii mi i m ii ii ii 1 1 ii i m i 

265 GTATATAGAGAgGGaAATCCATTTCcAaTATGTaAactgTCTTTgcgGTTcTtATCTAAAATTAGTGA 

minimi n mini n inn i inn in i iimniiimi 

260 GTATATAGAGAtGGgAATCCATATGctGTATGrgAtAAATGT-rTAAAGTTTTATTCTAAAATTAGTGA 

i mi n i ii i i iii i i ii in niinmi iiiiini in i inn 

264 GTATATAGgGACGacAcACCACAcGgaGTgTGTacaAAATGTTTAAqaTTTTATrCaAAAgTaAGTGA 

ii inn iii i m ii i i ii linn n iiiimi i i i n n 

267 GTgTATAGaGACagfcAtACCeCAtGetGcaTGecatAAATGTaTAgat/rrTTArrCtJVgAaTtAGaGA 

gT-TataGaGacggcaatCCatatCcag-aTGtg~-aaaTGttTagaatTt.i:attctAaAaTtAgtgA 
C-4 4 CTCTCYCCWWAGGTAWACGH-CU JJl-aattagnga 
C-iSCTCTGTCATATCGCGTACGA-C45 AUSB-GTGA 
C- 4 6 CCCTCCTGTGTGGTGTGCCT-C46 S6-GT 
C-47CYCTGCYGWWWGGTAWACSW-C41 
C-48CYCTGYYGUVAGGTAUACGU-C48 
C-4 9 CYCTGYYGUDWGGTAUACSff-CO 

CSS -HGAGACRGCWWTCCATWTG-C5 6 

C 57 -MGAG ACRGSWVrrCCATVfTG-CS 7 

C 5 8 -MGAG ACRG VWWTCC ATWTG -C 5 8 

C59-AGAGACAGTATACCGCATG-CS9 
GTGTGGCGAGACAACTTTCCCTrrGCAGCGTGTGCCTGTTG-Ol 

03-CAACTTTCCCTTTGCAGCGTGTGCCTGTTG-03 
CACACCGCTCTCTTCAAACGGAAACCrCGCACACCGACAAC-04 
GTATATAGACATGGGAATCCA-06 

GTATATAGAGATGGGAATCCATATGCTGTATGTGATAAATG-07 
CATA TA TCTCTACCCTTAGGTA TACGA CA TACACTA TTTA C- 0 3 
O9-ACCCTTAGGTATACGACATACACTATrTAC-09 
010 -CTGTATAGAGACAGTATACCCCATCCTGCATGCC ATAAATG TATAGATTTTTAT7CTAG AATTAGAGA-0 1 0 
GTGTATAGAGACAGTATACCG-Ol 1 

GTGTATAGAGACAGTATACCCCATGCTGCATGCCATAAATG-01 2 
CACATATCTCTGTCATATGGGGTACGACGrACGGrArrTAC-OL3 
014-GrCAWrCGCGTACCACGTACGCTArrrAC-014 

017 -CTGTTTCGAGGCGGCT ATCC A-Ol 7 

018 -CTGTTTCGAGGCGGCTATCCATATGCACCCTGCGCGTGCTG- 018 

0 1 9-GCCGATAGGTA TACGTCGGACGCGCACGAC- 019 
GACAAAGCTCCGCCCATAGGTATACGTCCCACGCCCACGAC-020 
024-GTATATAGAGATGGGAATCCATATGCTGTATGTGATAAATGTTTAAAGTTTTATTCTAAAATTAGTGA-024 
GTGTATACAGACAGTATACCG-02S 

02S-CAGTATACCCCATGCTGCATGCCATAAATG-026 



13 



EP 0 477 972 B1 



329 ATATAGACACTTTgArTATGCTCgATATGCAaCaACAGTtGAACAAGAJUlCtAAacAAGAcATcTTAg 

iim ii i ii i ii i iiiiiiiii Minn i urn mmum u uu it m 

329 ATATAGACACTTTAATTATCCTCcATATGCAt^TACAGTAGAACAAGAAACCAAtgAAGATATtTTAa 

iimmi i mini m inn i mi inn u i n n i i 

333 ATATAGACATTATAATTATtCTGr ATATGG AAST ACATT AGAACAAacAgttAAaAAACCTTTaa aTG 

imiiiiiiii mi i i iiiiiii iimiiiiu I II llltl II II 

328 gTATAGACArrATtGTTATAGItTCTATGGAACAACATTAGAACAgcaAtacAACAAACCgTTgTGrG 

i mi m i mm miimnmmmi i mm i mi 

332 ATTTAGAtggTATaGATATAGTGrGTATGCAACAACATTAGAAAAAfcTgACaAACAAAGGtaTATGTG 

iii m m i i iimmii mil mm i n mi n m 

335 ATTaAGAcatTATfccAgACtcTGTGTATGGAgacACATTgGAAAAACTaACtAACACtGGgtTATaca 

aTatAGAcatTaTaattAt-cTgt-TATGgAacaACAtTaGAA-Aa-aaactAAcaaag-t-Tat-tg 

atatagacatt-JJl 

GTATAGACATTAT-AUS8 

CS0—ATAHSACAYATACSTTGWTGTMATCTT-C5Q 
C51- ATAHSACAYATACSTTGHTGTMATC—CS1 
CS2-ATAHSACAYATACSTTGHTGTMAT-CS2 
CS3-CTGAGACACATACCTCTGTGTGTAACC-C53 
C5 4-CTGAGACACATACCTCTGTGTGTAA-CS 4 
C5 5-CTGAGACACA TACCTCTGTGTGTA-CSS 
O10-ATTAAGACATTATTCAGACTCTGTGTATGGACACACATTGGAAAAACTAACTAACACTGGGTTATACA-O10 
024 -GTATAGACATTATTGTTATAGTTTGTATGGAACAACATTAGAACAGCAATACAACAAACCGTTGTGTG-02 4 
TATATCTGTGAAATTAATACGAC-S6 

39 7 AcGTGcTAATTCGgTGcTACCTGTGTCACAAaCCGcTGTGTGAAgTAGAAAA ggTAAAaCAtATAcT 

i iii mini ii mimmim m miiiii mini mi n m i 

397 AAGTGTTAArrCGtTGTTACCTGTGTCACAAgCCGTTGTGTGAAaTAGAAAAA CTAAAgCAcATAtT 

ii i mm i m i urn i n mm mini i i 

401 AAaTaTTAATTAGGTGTATTATaTGTCAAAgaCCtTTGTGTCCTcAAGAAAAAaAAcGACATgTGGAT 

i i miiiimmm mini n mum iimmi ii nm mi 

396 ATTTGTTAATTAGGTGTATTAacTGTCAAAagCCacTGTGTCCTGAAGAAAAgCAAAGACATcTGGAc 

immmumim i iiiiiii u mini mum mmm mi 

4 0 0 ATTTGTTAATTAGGTGTATaAcGTGTCAAAgACCGTTGTGTCCAGAAGAAAAACAAAGACATtTGGAT 

mi nm nm i iii ii i iiiiiii nm mum nm i u 

40 3 ATTTaTrAATaAGGTGccTqcgGTGcCAgAaACCGTTGaaTCCAGcAGAAAAACttAGACAccTtaAT 

AttTgtTAATtaGgTGtat-- tgTGtCAaAaaCCgtTGtgTccagaAGAAAAaca-agAcatet-- at 
AUS4 -AATTAATCCACATAAT-AUS 4 AUS9-GATTTATTTG 
AUS5-TGTCATAACCTTCAATG1CT-AUS5 
010-ATTTATTAATAAGGTGCCTGCGGTGCCAGAAACCGTTGAATCCAGCAGAAAAACriAGACACCTTAAT-OlO 
024-ATTTGTTAATTAGGTGTATTAACTGTCAAAAGCCACrGTGTCCTGAAGAAAACCAAAGACATCTGGAC-O24 



45 



SS 
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464 aaccAAGGCgCGgTTCATAAA gCTAAATtgtacGTGGAAGGG TCGcTG 

inn ii mum nun miiim m n 

4 64 gggAAAGGCaCGcTTCATAAAA CTAAATaaCcaGTGGAAGGG TCGTTG 

II lllllll I I I II Mil Mil 

469 1 1 AAAc aAAcG AT^TC ATAAT AT TtcCGGTCGtTGGGCAGGGCGcTGTgcCgCgTGTTG 

iii ii mi mum mm m i u n m i i urn 

464 AAAAAGcAAaGATTCCATAATATA aGGGGTCGGTGG ACcGG tCGaTGTATG tCtTGTTG 

mm u mini u m i ii miiim- u n u u i urn 

4 68 AAAAAGaAACGATTCCACAACATAG GaGGAaCGTGGACaCGaCGtTGCATagCaTGTTG 

1 1 ii mm urn iiiii n i i m i 

471 gAAAAacgACGATTtCACAACATAGctgggcactataGAgGecaGtgccattcgTGCtgeaaccGagc 

— ctaaaggacg-tgGgcagggcg-tgcatggct-Gttg 
AUS6-AAATGTATAGATTTTTATTC-AUS6 

C65-CAACCGAGC 

O10-GAAAAACGACGATTTCACAACATAGCTGGGCACTATAGAGGCCAGTGCCATTCGTGCTGCAACCGAGC-O10 
02 4 -AAAAAGCAAAGATTCCATAATATA AGGGCTCGGTGGACCGGTCGATGTATGTCTTGTTG-02 4 

6 512 CcTACACTGC TGGACAACATGCATG GAAGACaTGT 

i 1 1 1 1 1 1 1 1 iimimimii mm m 

11 512 CtTACACTGC TGGACAACATGCATG GAAGACtTGT 

i IIIII II I II Mil II 

33 528 gaggtceegACGTAGACAAACTGCactgtgAcgTGTAAAAacgcCATGa.gagGACACaagcC 

MIMIMIIII I Mill MM MM I 

16 52 3 cagatcatcAAGAaCACGTAGAGAAAC CCAgcTGTAA tCATGCATGGACAtACAC C 

mi i m i iiiii mi iiiii inn mm mi i 

31 52 7 GagAAGACCtCGTActCAAAC CCAagTGTAA aCATGCgTGGAGAaACAC C 

I MM I I II I I I II I II I 

18 539 acgacaGgaAcGACtcCaacgacgcAgagaaacaCaAgtataAtattAaGtaTGcAtggACctaaggC 

con — ga — gagaagaceacgta-aga-Actgca ocaggtgtAaaacatgcaTCgagagAcacaaggc 

C64-GAACACGTAGAGAAAC CCAG-C64 
ACGACAGGA-C65 
C 6 6 -GAGGTCCCG ACGTAGAGAA-C 6 6 
0 1 0 - ACGAC AGGAACGACTCCAACGACGCAGAGAAACACAAGTATAATATTAAGTATCCATGGACCTAAGGC-0 1 0 
02 4 -CAGATCATCAAGAACACGTAGAGAAAC CCAG-02 4 

€ 547 TACCCTAAAGGA TATtGTAtTAGACCTGCAaCCTCCaGACCCTGTAGGGTTACATTGCTATG 

immiim mi iii u ii m ii 1 1 iiiii mmiimiMMmmiii 

11 54 7 TACCCTAAAGGA TATaGTAcTAGACCTGCAgCCTCCTGACCCTCTAGGGTTACATTGCTATG 

1 1 lllllll II MM MMII II I Ml MIIIM 

33 590 aAC g TT AAAGGA ATATGTtTTAGA TTTatATCCTCAaCCAACTCAcCTATACTGCTATG 

ii ii iii mil mil m i ii ii iiiiiii 1 1 him mi 

16 57 9 TACaTTGCAtGA ATATaTGTTAGA TTTCCAACCaGACaCAACTGAtCTCTACTGTTATG 

III Mill II Ml lllllll MIIIIIM Ml lllllll Ml IMMIIM 

31 577 TACgTTCCAAGAC TATgTGTTAGA TTTGCAACCtCAGgCAACTGACCTCCACTGTTATG 

ii immiim i iiiii i i mm i mi i i 

18 607 aACaTTGCAAGACattgtaTtgcatTTAGAgccccaaaAtgaaattcCggtTGACCTtCtaTGTcAcG 

con tAC-tT— AgGAc at-tgt-tTAGAcctt catcc-ga-cCa— tGaccTacacTG-tAtG 

BE 1 6 -ACCAGAGACAACXGAXCXCXACXCX-BE 1 6 
BE18-GXXAGAXXXGCAACCACAGACAACXGAXCXCXAC-8E18 
0 1 0-AACATTGCAAGACATTGTATTGCATTTAGACCCCCAAAATGAAATTCCGGTTGACCTTCTATGTCACG-0 1 0 

C89-G 
C90-G 
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6 609 AGCAATTAGtAGACAGCTCAGA AGATGA GGTGGACgAaGTGGACggACAAGAttCACAACCT 

iiiiiiiii illinium nun mini i nun nun mum 

11 60S AGCAATTACaAGACAGCTCAGA AGATGA CGTGGACaAGGTGGACaaACAAGAcGCACAACCT 

imim iiiiiimn iiiii nm i n mi immi 

33 649 AGCAATTAAgTCACAGCTCAGAtCACGATCAaCgctTGGACcGGCCAGATGGACAA CCACAACCa 

iiiiiiiii minium inn n i i n i mi iiimi in mi 

16 63 8 AGCAATTAAaTGACACCTCACAgGAGGAGGATGaaATAGAtgGTCCAGCTGGACAA GCAGAACCG 

mum minimi imiiiiii nm mnmmm mmm 

31 636 AGCAATTAccCGACACCTCAGAtGAGGAGGATGtcATAGAcaGTCCAGCTGCACAA GCAGAACCG 

mum mi iiiii mi nm i i i 

18 675 AGCAATTAagCGACteagagGAaGAaaAeGATGaaATAGA tggagttaateatcaacatttAcCaG 

con AGCAATTAa — GACagctcaGAtga-gAtGAtga-aT-GAc-gg-c-gatggacaagacgcacAaCcg 

AGCAATTAGWAGAC-C 8 9 BE8-GACGAAGXGGACGGACAAGAXXC-BE8 

AGCAATTAARVGAC-C2Q BE9-GA GGXGGACGAAGTCGACGGACAAGATTCACAACC-BE9 

BE13-XGA GGXCCACAAGGXGGACAAAC-BE13 

BE 1 3 -CAGAACCG 
BE 1 7-ACAAGCAGAACCG 
C6 2 -CC AAGTCGACGCACAACAT-C 6 2 
C 6 3 - C AAGGTGGACAAACAAC AC G-C63 
01 O-AGCAATTAAGCGACTCAGACGAAGAAAACGATGAAATAGA TCGAGTTAATCATCAACATTTACCAG 

6 671 TTAAaACAACATTtCCAAATAgTGACCTGTTG CTGTGGATGTGAC AGCAACGTtCGA 

mi ilium iiimi iiiiiiiii! uunimui ilium m 

11 671 TTAACACAACATTACCAAATAcTGACCTGTTG CTGTGGATGTGAC 

ill nm i u i mum in u 

33 714 GcCACAGCtgATTACtAcATTGTAACCTGTTGT caCActTGTaAC 

i ii iii iiiii i m i m ii 1 1 1 mi ii in n 

16 7 03 GACAgAGCCcATTACAATATTGTAACCTTTTGTTG CAAGTGTgACT 

mi i ii iiiiiiiiu ii iiummi nm i i 

31 701 C-ACAcAtCCaATTACAATATCGTtACCTTTTCTTGT cAGTGTAAGT 
I II I I III II llllll IIIII}} 



AGCAACCTcCGA 
I II 11 II 
ACCACaGTTCGt 
III llll 
CTACgCTTCGg 

mi inn 

CTACACTTCGt 
IIIII 



18 7*1 ccCgacgogccgaACcAcAaCGTcACacaaTGnGTgtatgtgtTGTAAGTgtgaagCcAgAaTTgag 
:on g-eaeag — catt«CcA-At-gT-ACctgtTGttgt-ctg — g-TGT-Act accAcagTteg- 

GACAGAGCCCAX-BE 15 BE 1 9 - AGXGXGACX CXACGCXXCGG 

GACAGAGCCCA-BE 1 7 BE2 0-XXGCAAGXGXGACX CXACGCXXCGG 

BE24-XXCXAAGXGXGAAGCCAGAAXXGAG 
BE25-AXGXGXXGXAAGXCXGAAGCCAGAAXXGAG 
0 1 0 -CCCGACC-AGCCGAACCACAACGTCACACAATGTTGTGTATGTGTTGTAAGTGTGAAGCCAGAATTGAG -0 1 0 
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7 2 B CTGGTTGTGcACTGtACAGAaacAGACATCAGAgAAgTgCAAcAgCTTcTGtTCGGaACACTAAAcAT 

iiiiiiiii mi inn iiiiiniif ii i iii i iii ii nil mum n 

728 CTGGTTGTGgAGTGcACAGAcgCAGACATCAGAcAAcTACAAgAcCTTtTGcTGGGCACACTAAATAT 

i iii i i mi i iii i ii mi i ii i inimi i inn 

771 TT»TGTGTcaAcACtACAGcaaGtGACc7aCGAACcaTACAgcAaCTacTtATGGGCACAgTgAATA.T 
II II II I II III III I II II I I I II I IIIIIIIII I II 

760 TTGTGcGTACAaAGCACACAcGTAGACArTCGtACtTTGgAAGAcCTGTTAATGGGCACAcTaGGAAT 

inn inn nniin inn inn i in nn inininni n i inn 

758 TTGTGtGTACAgAGCACACAAGTAGAtATTCGcAtATTGCAAGAGCTGTTAATGGGCtCAtTtGGAAT 

i in i in n ii n mi in n iniiii n i i i i 

809 cTagtaGTAgAAAGCtCAgcAGaeGAccTTCGagcATTcCAgcAGCTGTTtcTGaaCaCccTgtcctT 
-Tg — tGTacAgaGcaCAgaag-aCAcaTtcGaacatTgcAa-AgCTgtT-aTGggcaCacTaaa-aT 
XXG-8E19 BE29-AGCAACXGACCXACGAACCAXACA-BE29 C42-CCCCTCTGAYYYDTA 
XXGXGCGXAC-BE20 C43-CTTGTCCCACAGGAA 
CXAGX-BE25 

BE30-AGXACACCAAGXGACCXACGAACCAXACAGCAACX-BE30 
OlO-CTAGTAGTAGAAAGCTCAGCAGACGACCrrCGAGCATTCCAGCAGCTGTTTCTGAACACCCTGTCCTT-OlO 

796 aGTGTGTCCCATCTGCGC AC CgAAgaCcTAACAAcCATCGCCCACCATTCAGGTACAGAAAAT 

iiiiiiiiiiiimii ii mii nun n nnnminiiiininnni 

796 TGTCTGTCCCATCTGCGC AC CaAAACCaTAACAAgGATGGCGGACGATTCAGGTACAGAAAAT 

nun n i iii ii n nn i i i inn n i mini i 

839 TGTGTGCCCtAcCTGTGC ACAacAAtAAACATCAtCtAcaATGGCcGATcCTGaAGGTACAaAtGgg 

lllllllll I till I II II I III I I I III I I 

828 TGTGTGCCCCAtCTGTTCT CAgAAACcATAATCTACcATGGCTGATCCTCCAGGTACcAATGGGGAa 

1 1 1 1 ii i i n i i it 1 1 1 i ii iii nn minimi mmn mini 

826 cCTCTCCCCCAaCTGTTCT aCtAgACtGTAA CTACAATGGCTGATCCAGCAGGTACAGATGGGGA 

mn n m i i i nn i mnniiimm iiiiiiiii inn 

87 7 tGTGTGtCCgtggTGTgC otCccagCaGTAAgCaACAATGGCTCATCCAGaAGGTACAGAcGGGGA 
tGTGTC-CCcatcTGtgCtaca-aaacaataotcaaCaAtg— — G-t— g~gg — -ta-ag-ggat 
C 4 Q - CACACRGCGTAGACRCC -CiO C75-ATGGCKCAVCCTGMAGGX'AC-C75 
C41-CACACAGCCACCACACC-C41 C76-ATGGCXGAYGATTCAGGTAC-C76 
ACACAC-CH C 7 7- ATGGCKGAYCCTTCAGGTAC - C 7 7 

A CACAC- C 4 3 C 8 1 - TA CCGMCTRGGA CKTCCATG-CSl 

C82 - TACCGMCT%CTAAGTCCATG-CQ2 
C8 3 - TACCCMCrflCCAACrCCV) TC-C 8 3 
Old -TGTGTGTCCGTGGTGTGC ATCCCAGCAGTAAGCAACAATGGCTGATC-C'l 0 

85 9 GAGGGGTCtCCCTGTACAGGATGGTTTATGGTAGAAGCtATAGTgcAaCACcCaACAGG TAC 

mmn iimmmiimmmiiiiui mn i m i mn in 

859 GAGGCGTCGGGGTGTACAGGATGGTTTATGGTAGAACCcATAGTAGAGCACACt ACAGG TAC 

i m nmim ii 1 1 1 1 1 1 mmm i mn i mn 

906 GctGGGAtGGGGTGTACTCGtTGGTTTgAGGTAGAAGCaGTca.TAGAGAgAAgAACAGG aGA 

i ii i iii mi m imii i mn n n mm ii mm n 

895 GaCGGtACGGGATCTAATGGaTCGTTTTATGTAGAgGCtGTAgTgGAaAaAAAAACAGG GGA 

m mum mn iimiiiiimi n m mm m mn in 

891 GGCgACGGGATGcAATGG tTGGTTTTATGTAGAAGCaGTAaTtGACAgAcAgACAGG GGA 

m mil n ii ii mmiimi nn i i mi i i mn 

94 3 CGGcACGGGtTGtAAcCCcTGGTTTTATGTAcAACCtaTtgTaGACAaAaAaACAGGagatgtaat 

gagGGgacgGGgTGtA- tGGaTGGTTTta-GTAgAaGCt-TagTagA-aaaaaaACAGG a 

C7 9 -TGTAMWGGMTGGTTTTATGT-C7 8 _ 
C79-TGTAMWCCMTGGrrTGAGCT-C79 
CS0-TGTAHWGGMTGGTTTATCCT-C8O 
C 8 4 - A CA TKWC CKA C CAAAA TA CA - C 8 4 
C8 5 - ACATKWCCKACCAAACTCCA-C 8 5 
C8 6-ACATKWCCKACCAAATACCA-C8 6 



ss 
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6 921 ACAAATATCAGAcCATGACGAtGAGGAGGTGGAGGACAGTGGGTATGACATGGTGGACTTTATTGA'IG 

illinium mimi iiimimiiimimiiiiiiiiiimiiiiiiiiinii 

11 921 ACAAATATCAGAAGATGAGCAaGAGGAGGTGGAGGACAGTGGGTATGACATGGTGGACTTTATTGATG 

i it 1 1 1 1 1 m i n 1 1 1 1 ii t ii mum n i i u inn mi 

33 968 TaATATtTCAGAAGATCAGGAtGAAAcaGcaGATCAC^TGGcaegGATTTacTAGAgTTTATAGATG 

i in urn n iii i mi i mi n nm nn mini 

16 957 TgcTATaTCAGAtGACGAGAAcGAAAAtGacAGTGATACaGGtGAaGATTTGGTAGAtTTTATAGtaA 

ii nm mn ii in i mum n n in mi n nm i i 

31 951 caacATTTCACACC ACCAAAATG AAqAC AG tAGTG AT AC tGGGGAGGATATGGTfcGAcTTTATTGAc A 
I I I I I I I I I I II I I llll I II III I I I I I I f 1 I II llllllll I 

18 1009 ate a gaTgacGAGGACGAAAATG caACAG AcACaGGCtcCGATATCGTaGAtTTTATTGAtA 

con a-aaat-tcaGA-GA-CAg-AtGaa-a-g-ggatgAeA-tCGgtagGAtaTggTaGAcTTTATtGat- 

6 989 A CACcaATATTACA CAcAATTCaeTGGAAGCACACGCATTCTTTAAcACGCAGGAGGCG 

i m nmin n nm mmmnnnmmi minium 

11 989 A CAGgcATATTACA CAAAATTCtGTGGAAGCACAGGCATTGTTTAATACGCAGGAGGCG 

i i i nn ii i mi m i minimmi mn i i 

33 103 S ATtcTAtgCAAaATAgtATACAGGCAGAeACACAGGCAGCCCgGGCATTGTTTAATAtACACGAAGgG 

ii ii n ii i immn mm nn i n iiinii i mmn 

16 1025 ATgaTAATCAtTATttaAcACAGCCAGAAACAGAGACAGCACAtCCgTTGTTTAcTCCACACGAAGCa 

ii mn ii i iiiiiiii! immmni n nini nimimii 

31 1019 ATtgTAATGtATAcaacAAtCAGGCAGAAgCAGAGACAGCACACGCATTGTTTeATGCACAGGAAGCg 

ii i iiiiiiii iiimiiimiiiiimi nm nm i 

18 107 1 cacaaggaacATtttgtgAaCAGGCAGAgctAGAGACAGCACACGCATTGTTcCATGCgCAGGAgGtc 

con attat&atgcatatataataCAggcagA — eagaG-cAGCaCagGCaTTGTTtaat-c-CAGGA-Gcg 

6 104 8 G AcaCcCATTATGCG ACTGTGC AGGACCT AAAACGAAAGT ATT TAGGt AGTCCATATGTt AGTCCTAT 
II I I I I I f i t 1 I I ( I I 1 N I 3 1 M I I t I M ! 1 II I I i I [ I I I I I I I I I I I f II I 1 IIIIIIII 

11 104 8 GAtGCTCATTATGCGACTGTGCAGGACCTAAAACGAAAGTATTTAGGcAGTCCATATGTaAGTCCTAT 
II I I III fllll I I I I I I I 1 I I I I I J I I 

33 1104 GAgGATgATTtAaATGCTGTGtgtGcaCTAAAACGAAAGT TTGCCgc 

i i ii i iiii ii i miiiiiimi I I I 

16 1093 aAacAACATagAGATGCaGTaCAGGTTCTAAAACGAAAGT AT TtGGTAGTCCa 

i nm m ii ii iminiimmim n i iiiiiiii 

31 1087 gAggAACATGCAGAgGCtGTGCAGGTTCTAAAACGAAAGT ATgToGCTAGTCCt 

i i nm i i mi m mmnim 

18 1139 cAcaAtgATGCAcAaGtgtTGCAtGTTtTAAAACGAAAGT ttgcaggaggcagcacaga 

con gA-gatcATt-agaggctgTgcagGttcTAAAACGAAAGTatttagg-agtcca — tgtga-tgcc-t 

BE 1-XAAAACGAAAGX-BEl 
BE2-AGGACCXAAAACGAAAGXAXXXAG-BE2 
BE3-AGGXXCXAAAACGAAAGXAXXXGG-BE3 
BE4-AXGXXXXAAAACGAAAGXXXGCAG-BE4 
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6 1116 AAaCAcTaTAGCegAgCCACTgCAAAGTGAaATAACTCCACGaTTgCACGCCATTAAACTTACAAgAC 

i! ti i mi i mi! iiiniii minimi n mmmmimm n 

11 1116 AAGCAaTgTAGCTaATGCAGTaGAAAGTGAgATAAGTCCACGgTTaCACCCCATTAAACTTACAAcAC 

II I I III I II II II I I I II I 

33 1151 ATCttcacaAagXGcTCCgGagCAcgVrGTtCAtcGTgeTgcaAacCCgtGtAgAAcgtCTATtAaTA 
I II I III 1 1 II I I I I II Mil II 

16 2146 cTtAGTGATATTAG TGGaTGTGTaCAcaATAATATTAGTCCtaGarrAAAAGCTATATGTA 

i mi i m ii i i i m n ii mm mi m i mini mini i 

31 1141 tTAAGTGATATTAG TaGtTGTGTCGATtATAATATTAGTCCACGGTTAAAAGCTATATCcA 

i inn linn i i iiiiiiiiiiii in nn 

18 1198 aaAcagtccATTACgggagcggctggagGTGGATacagAgtTaAGTCCACGGrrAcAAGaaATATctt 
con a-aca-tatAfctagaggcagtggaa-gtGtggatagt — t-taagtcc — g — taaaagctAta-gta 



6 1184 AGCCAAAAAAGGTAAAGCGACGGCTGTTTcAAACcaGGGAAcTAACGGACAGTGGATATGGCTATTCT 

iimmimiimimmnm nn inn iimnnmmmnmni 

11 1184 AGCCAAAAAAGGTAAAGCGACGCCTGTTTgAAACAcGCCAAtTAACGGACAGTGCATATCGCTATTCT 
I III I I I III I II II Hill MIMI! || || 

33 1219 AAaAtAAAgAAtCcAcAtacaGAAAAcGaAAAATAgaTGAgcTAGAACACAGCCGATATGCCAATACT 

i i iii n i i i mi i ii n m minimi iimmmi 

16 1207 TAGAAAAAcAAAGTAgAgCtGCAAAAaGgAGAtTATTTGAAagcGAAGACAGCGGGTATGGCAArACT 

imm i nn i i mm i m i mm nmnmmnmmi 

31 1202 TAGAAAATaAcAGTAAAACaGCAAAAcGflAGACTcTTTGAAcTtcCAGACAGCCGGTATGGCAATACT 

ii i i i mi mm i i n in n nn n n mm i n 

18 1266 TAaAtAgTgggcagAAAAagGCAAAAaGgcGgCTgTTTacAaTatCAGAtAGtGCcTATGGCtgTtCT 

con -a-aaAaaaag-g-Aaa — ag-aaaa-g-a-aatatt.tgaacta-caGAcAG-GCaTATGGC-aT-CT 

JJ3-tatggctattct 
C81-ATACCGTTAWGA 
C88-ATACCGA1-AWCA 



6 1252 GAAGTGGAAGCTGgaacgggAACG CAGGTACACAAACA 



iiiimiimi 

11 12 S 2 GAACTGGAAGCTG 

inmm ii 

33 128 7 GAAGTGCAAACT 

minmm 

16 127 5 GAAGTGGAAACT 

mimim 

31 127 0 GAAGTGGAAAC 

mmin i 

18 1334 GAAGTGGAAgC 



mi nmimnm 

CAACG CAGGTAGAGAAACA 
I till llll 
CAGCAGAT GGTA 

mum i 

CAGCAGAT G 

mum i 

gCAGCAGAT G 
lllll 



III 
tTACA 

llll 
gTACA 

llll 



I I 
GGTAG 
lllll 
GGTAG 

nm 

GGTAG 
llll 



aacaCAGATtcaggt&acTACAaatggcgaacatggcggcaatgtatGTAG 



GAAGTGGAA-Ctg gca-caCataggtagag-- 

gaagtggaagctgnnnnncnacagat-JJ3 

CTTCACCT-C87 

CTTCACCT-C88 



GtaG 
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6 129 5 taCCGGAAAATGG GCGAGATGGTCAGGAAAaGGA 

lllllllllll llllllillllllllll III 

11 1289 A CCCGGAAAATGC CCCAGATGGTCAGGAAAgCGA 

I Illllll II III II 

33 1321 A — AagtcAAAATGG cgACAC AaActtaaAtGActtaGA 

I I I llll I III 

16 1306 A A gggcGccatgagactgAAACACcAtgtagtcAgtAtagtGg 

, i iiiii 

31 1301 A GGAG CAAC AAACA AC 

tin iiii mi it 

la 139 6 tggcggcagtacCCACgctatagaCAACgggggcacagagggcAACA AC 



con a — ■ ggag aacgcaaaatgg aga — gaaacacgagatggtcaggaaaggga 

6 1329 CACAGGaAGGGACATAGAGGG GGaGGAACATAcAGAGGCGGAAGCgcccaeaaACAGtgtaC 

nnn iimiimmi n nmm iiiiiiiiiiii iim i 

11 1323 CACACGgACCGAcATAGAGCGTgagCgGGTCGAACATAgACACGCCCAAGCagtagacGACAGcaceC 

I I III II I I I II I I II III. 

33 1358 atCtAGTGGGGtgGGgGAtGaTtcaGAaCTaAGctGTgagacaaatGtAGaTagctGTGAAA 

mini ii i i ii i iii mm 

16 1349 tggAAGTGGCCgtCGttgcagTcagtAcaCTAGTGGaagtggCGGAGagCgTgtTAGTGAAAGAeAcA 

iiii i mm mi mini n i i 

31 1317 AttAAGT tgtaATgGTAGTG ACGCGA cAcATAGTGAAcGAgAgA 

I I llll II II I II I 

18 14 45 A gcagtgtagacggTacaAGTG AC aAtAgcaatAtaGAaAat 

con a-caagtagggacagaga-ggt-agga-g — agtgataga-cgg — gaagca agtgAaaga-a — 

6 1391 GgGACCATGCACgCACAgCACGAATAT TgGAATTgtTAAAATCTAAaGATtTAC GggCagCATT 

i iiiiiiiiii mi iimnii I IIIII I t f I I I 1 I I I III III I I llll 

11 1391 GttGAGCATGCAGACACAtCAGGAATAT TAGAATTacTAAAATGTAAgGATATAC GAtCtACATT 

ii mm II III I I I I I llll III I 

33 1420 atgttACgttgCAGGAA AT TAGtAATGTTCTACAtAgTAGTAATACAAAACCAAatAT 

iiiii iiiii in i i mini iiii iii ii 

16 1417 cTatAtgcCaAACACcacttacAA ATATTTTaAATCTaCTAAAAACTAGTAATCCAAAgGCAGCaAT 

i i i inn i iiii ii i ii iiiiiiiiii nn ii n n n 

31 1361 aTgAAaC tCCAACAC GtA ATATATTgcAaGTGTTAAAAACXAGCAATGgtAAAGCtGCTAT 

I ii i in I I i ii I miim i mi m inn 

18 14 8 7 gTaAAtCcaCAAtgtaccataGcACAatTAaaagActTGTTAAAAgtaAaCAATaaacAAGgaGCTAT 
con gtgaa — t-caa-c-ca-caggaAtAtattagaaatgtt-tAaaaa — ag-aaTacaaaagcagc-aT 

6 1455 AC tTGGTAAGTTTAAAGAaTGCTTTGGGCTGTCtTTTalaGATTTAATTAGGCCATTTAAAAGTGATA 

ii iimiiimmi imimimii m i imminmiimmimiii 

11 1455 ACaTGGTAAGTTTAAAGAcTGCTTTGGGCTGTCaTTTgTtGATTTAATTAGGCCATTTAAAAGTGATA 

i m iiiiiiii ii m i in i ii m i n miiimiiimi 

33 14 78 ATTAtaTAAATTTAAAGAGgcCTATCCaaTaAGTTTTaTgGAATTAGTAAGACCATTTAAAAGTGATA 

m 1 1 i i f ! 1 1 1 1 1 1 ii ii i mm i n 1 1 1 1 1 1 1 1 1 m 1 1 1 n 1 1 1 1 i in 

16 1484 GTTAGcaAAATTTAAAGAGTTATAcGGgGTgAGTTTTtcaGAATTAGTAAGACCATTTAAAAGTAATA 

nm i m 1 1 1 1 1 1 1 1 inn ii ii miii m n i n mm mi nn 

31 14 22 GTTAGgtAAATTTAAAGAaTTATATGGtGTAAGTTTTAtgGAAcTAaTTACgCCATTTcAAAGcAATA 

nm in iiiii mm n mm ii ii mi iii iiii iii 

18 155 5 GTTAGcagtATTTAAAGAcacATATGGgcTAtcaTTTAcaGAttTAgTTAGaaatTTTaAAAGtgATA 
con -ttaggtaaaTTTAAAGA-t- 
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6 1523 aLAACaACATGTttaGACTCGGTGGTaCCAGGgTTTCGTATACATCATAGCATAtCAGAgGCATTTCAA 

in linn milium inn nmmmmmim mi immii 

IX 1523 gAACcACATGTgCcGATTGGGTGGTtGCAGGATTTGGTATACATCATAGCATAGCAGAtGCATTTCAA 

m i m i mm i mm in n n mini n n 

33 154 6 AAACAAgcTGTaCcGATTCCTGTATaaCAGGATaTGGAATTAgTCCatcagTAGCAGAaAGTTTAAAA 

ii mi iii mimim i i it mi in n mi n in inn 

16 1552 AAtCAACgTGTtgcGATTGGTGTATtGCTGCATrrGGAcTTACACCcAgtaTAGCtGAcAGTaTAAAA 

ii mmi mmm i inn mm i immii mi 

31 1490 AAAgCACaTGTACtGATTGGTGTgTAGCTGCgnTGGAGTTACAggtACAgTtGCAGAAGGATTTAAA 

, m iii mmi i i mi ilium i in i miimiimiimi - 

18 1623 AAAeCACgTGTACaGATTGGgtTacAGCTataTTTGGAGTaAacccaACAaTaGCAGAAGGATTTAAA 



6 1591 AAaTTAATTGACCCATTAAGTTTATATGCACATATACAATGGCTaACAAATGCATGGGGAATGGTAtT 

ii MmimimmmmmmimiiiiiMM niimiiimiMiMii i 

11 1591 AAgTTAATTGAGCCATTAAGTTTATATGCACATArACAATGCCTtACAAATGCATGGGGAATGGTAcT 

MMM I I Mill Ml I III IMMII III I I Mill II I 

33 1614 gtATTAATTaAACAgcATAGTTTgTATaCtCATtTACAATGTTTAACtTGcGataGaCGAATaaTAaT 

i ii i mi ii mi m ii i m inn i n i inn i i 

16 1620 ACACTATTaCAACAATATTGTTTaTATTt aCAcaTtCAAAGTTTAGCATGTTCaTGGGGAATGGTTgT 

ii mmi mi iiimm mi n i 11111111111111111 mmi mm i 

31 1 S 5 8 ACcCTATTgCAACCATATTGTTTgTATTgCCATtTaCAAAGTTTAGCATGTTCcTGGGGcATGGTTAT 

II III I II III I I II Ml 1 1 1 1 I Ml M Ml Ml Mill I I II 

18 16 91 ACaCTAaTaCAgCCATtTataTTaTATgcCCATaTtCAAtGTcTAGacTGTaaaTCGCGagTatTaAT 
con aca-TAaTtcA-Ccat-tagtTTaTATgcaCAtaTaCAAtGt-Ta-catgtgcatGgGGaaTggTaaT 
6 1659 gTTAGTATTAtTAAGaTTTAAAGTAAATAAaAGtAGAaGTACCGTtGCACGTACAcTtGcaACGcTAT 

Mimiii i ii i Minimum n in mini mmm i i in in 

11 165 9 ATTAGTATTAATAAGgTTTAAAGTAAATAAgAGCAGATGTACCGTgGCACGTACAtTaGGTACGTTAT 

mi i mn ii mi i n i m i n n in i i m mi 

33 1G82 ATTAtTgTTAATtAGATTTAggTGTAGcAAAAACAGgTtAACAGTaGCAAAAcTAaTGaGTAAtTTAT 

m i iii i mi ii iii i mn ii mi i i im i n in m 

16 1688 GTTAcTaTTAGTaAGATaTAAATGTGGAAAAAATAGAgaAACAATTGAAAAATTgcTGtcTAAAcTAT 

„ ie e mi i i n mi iiiiiiii 1 1 1 ( 1 1 1 1 1 1 minimum i in in 

31 162 6 GTTAaTgCTtGTGAGATtTAAATGTGcAAAAAATAGAaTAACAATTGAAAAATTaTTAgaaAAATTAT 

„„ m ii ii i i mini n i mi mn in m m i n i 

18 175 9 aTTAgccCTgtTGcGtTacAAATGTGgtAAgAgTAGAcTAACAgTTGctAAAggtTTAagtAcgTTgT 
con -TTAgtatTa-TaaCaTttAaatgt-gtAAaA-tAGa-taACagTtCcanaa-tatTaggtA-gtraT 

6 172 7 TAAATATACCTGAAAAcCAaATGrTAATaGAGCCaCCaAAAATACAAAGTGGtGTtgcAGCCCTGTAT 

mmiimnm n mum mn n nmiiimm n mimm 

11 172 7 TAAATATACCTCAAAAtCAcATGTTAATtGAGCCtCCtAAAATACAAAGTGGcGTAcGAGCCCTGTAT 

ii mimm m i ii mn n in m n i i n inn 

33 175 0 TATcaATACCTCAAAcaTGTATGgTtATAGAGCCaCCAAAATTACGgAGccaAaCAtGtGCATTGTAT 

in i ii ii mm i iiiiiiii iiiiiiii ii ii I n mn iii 

16 1756 TATGTgTgTCTcCAAtgTGTATGaTgATAGAGCCtCCAAAATTgCGTAGtACAGCAGCAGCATTATAT 

mn i m m 1 1 1 m i ii imii him inn mn mimm n 

31 16 9 4 TgTGTaTATCTaCAAaTTGTATGTTAATTCAGCCACCcAAATTaCGTAGcACACCTGCAGCATTATAT 

i ii 1 1 ii minimum mn urn n ii i i mim mn 

18 1827 Tacac gTAcCTgaAAcTTGTATGTTAATTC AaCCACCaAAATTgCC aAG t Ag tG tTGCAGCAcTATAT 
con TatataTacCTgaAAattgtATGtTaAT-gAgcCaCCaAAAtTaCgaAGtagag — gcaCCa-T-TAT 
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6 179 5 TGGTTTcGtACAGCtATaTCAAATCCcAGTACAGTTATAGCGGAaCCaCCaGAATGGATAACaCGCCA 

nun i inn ii imiiii iiiiiiiiiiiiiiiii ii ii minimi mm 

11 179 5 TGGTTTAGgACAGGCATtTCAAATGCaAGTACAGTTATAGGGGAgGCgCCgGAATGGATAACgCGCCA 

mum mi ii urn in n m i n niiiiui i i 

33 1818 TCGTTTAGAACAGcaATgTCAAAcATTAGTGAtGTacAAGGtacaACaCCtGAATGGATAgAtAGACt 

mi ii inn" ii inn iiiiiiii ii i ii ii ii mimii i mi 

16 182 4 TGGTATAaAACAGGtATaTCAAAtATTAGTGAaGTgTATGGaGAcACgCCAGAATGGATAcAAAGACA 

inn i mill ii mn inn n u urn n n immmii mini 

31 17 62 TGGTAcAGAACAGGAATgTCAAAcATTAGcGAtGTATATGGtGAaACACCACAATGGAIAgAAAGACA 

inn minimi mn mn n in n n inn n mill mm 

18 1895 TGGTAtAGAACAGGAATaTCAAAtATTAG tG AaGTAatgCCaGAcACACCtGAgTGGATAcAAAGACt 



6 1863 aACaGTTATTGAACAcgGgTTGGCaGACAGTCAgTTTAAATTAACaGAAATGGTGCAGTGGGCgTATG 

n 1 1 1 1 1 1 1 f 1 1 1 i mn iiiiiiii 1 1 1 1 n 1 1 1 1 1 mmiiimmn mi 

11 1863 gACeGTTATTGAACATAGtTTGGCTGACAGTCAATTTAAATTAACTCAAATCGTGCAGTCGCCATArC 

ii iii i mimii n iii i i mi i nn mimii mnniimi 

33 1886 AACtGTTTTACAACATAGcTTTAATGATAaTAtATTTGAtTTAAgTGAAATGGTACAGTGGGCATATG 

iii ii milium immmii h mm in i iiiiiiii mn n i 

16 1892 AACAGTATTACAACATAGTTTTAATGATtgTACATTTGAaTTATCaCAgATGGTACAATGGGCcTAcG 

1 1 1 1 n 1 1 1 1 ii mmimiiii iiiiiiii n n n inmnmni n i 

31 1830 AACAGTATTACAgCATAGTTTTAATGAeAc&ACATTTGATTTGTCcCAAATGGTACAATGGGCATATG 

ii i mi in i i nn i i minimi nminnmimi n 

18 1963 tACtaTtaTACAaCATgGaaTagATGAtAgcAatTTTGATTTGTCagAAATGGTACAATGGGCATtTG 
con aACagTt-TaeAaCAtaGttTt-atGA-agtaaaTTTgA-TTa-cagAaATGGTaCA-TGGGCaTatG 
aacNgttatacaacatagtttHgatgat-JJ6 

6 1931 ATAATGACATaTGcGAgGAgAGTGAaATtGCATTTGAATATGCACAaaGgGGAGAtTTTGAtTCtAAT 

iiiim n n ii ii inn n mmmnmin i mn mn n in 

11 1931 ATAATGAtATtTGtGAaGAaAGTG AgATaGCATTTG AATATGCACAgCGTGGAGAcTTTCAcTCcAAT 

mi ii i n n mn n mi i i iiiiiiii i ii m i i m 

33 1954 ATAAcCAgtTAacgGACGATAGTGAcATTGCATATtAtTATGCACAACTTGCAGAtTCaAATAGTAAT 

mi n ii minimi mimii i mimn i mn i mnim 

16 1960 ATAATGAcaTAgTaGACGATAGTGAAATTGCATATAAATATGCACAATTgGCAGACACTAATAGTAAT 

I lllll I I II I I I I I I I II M I I I 1 i 1 I I I M M t 11 M M II llll I IIIIIIII 
31 1898 AcAATCAtgTtATgGATGATAGTGAAATTGCcTATAAATATGCACAATTAGCtGACAGTgATAGTAAT 

i nm i i mn n ii nun mini mm mn i n in 

18 20 31 AtAATGAgcTgAcaGATGAaAGcGAt ATgGCaTtTgAATATGCctt ATTAGCaGACAGcaAcAGcAAT 
con AtAAtGA-aTaa— GA-GAtAGtCAaATtGCaT-TgAaTATGCacaatt-GcaGAc— t-AtagtAAT 

6 19 9 9 GCAcGaGCaTTTTTAAATAGcAATATGCAGGCaAAATATGTgAAAGATTGTCCAAcTATGTGtACACA 

iii i ii niimmi imimiii iiiiiiii imiimnn mm mn 

11 19 99 GCAaGgGCcTTTTTAAATAGTAATATGCAGGCtAAATATGTAAAAGATTGTCCAATTATGTGcAGACA 

ii ii mum mn n n m mn n mi m mn mn 

3 3 2022 GCtgcTGCaTTTTTAAAAAGTAAcTCACAaGCAAAAATaGTAAAGGAcTGTGgAATAATGTGTAGACA 

ii iii iii iimimi mn iiiiiiii iiiiiiii mi n minium 

16 20 2 8 GCAaGTGCcTrrcTAAAAAGTAATTCACAGGCAAAAATtGTAAAGGATTGTGcAACAATGTGTAGACA 
III llll III I I I I III I III I I III I III III I- II II HI HI I I III I II I I I I I III 
31 1966 GCAtGTGCaTTTTTAAAAAGTAATTCgCAGGCAAAAATaGTt AAAGATTGTGgAACAATGTGTAGACA 

iii iii minimi mi ii ii in i iiimim iiiiiiii i iii 

18 20 99 GCAgcTGCcTTTTTAAAAAGcAATTgcCAaGCtAAAtat tTaAAAGATTGTGccACAATGTGcAaACA 
con GCa-gtCC-TTTtTAAAaAGtAAttcgCAgGCaAAA — tgTaAAaGAtTGTGcaAcaATGTCtAgACA 
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6 206 7 TTATAAACATGCAGAAATGAggAAGATGTCTATaAAACAATGGATaAAacATAGGGGTtCTAAAaTaG 

iiiiiiiiiiiiiiiiiiii iiihiuiii milium it mum inn i i 

11 2067 TTATAAACATCCAGAAATGAaaAACATGrCTATtAAACAATGGATtAAgtATAGCGGTaCTAAAgTtG 

imm i imm ii iiiii n imimi i m n in 

33 2090 TTATAAAaAAGCACAAAAAcgtAAAATGTCaATagGACAATGGATAcAAagTAGATGTGAaAAAacAa 

mini imimii urn u i mmm n imimi i i 

16 2096 TTATAAACGAGCAGAAAAAaaACAAATGagtATGaGtCAATGGATAAAAtaTAGATGTGAtAggGTAg 

immimiiimii iinui m i n inn m mmm i it 

31 2034 TTATAAACGAGCAGAAAAACCACAAATGtCCATGgGACAGTGGATtAAAagTAGATGTGAcAAAGTta 

-mil urn mmitmm in mmm i mini in i 

18 2167 TTATAggCGAGCccAAAAACGACAAATGaatATGtcACAGTGGATacgAttTAGATGTtcaAAAaTag 



6 2135 AagGcacAGGaAAtTCGAAaCCAATTGTaCAaTTcCTAcGACATCAAAAtATAGAATTcATTCCtTTT 

i i iii ii urn mum n n in minim uiiim iiiii m 

11 2135 AcaGTGtAGGtAACTGGAAgCCAATTGTgCAGTTtCTAAGACATCAAAACATAGAATTTATTCCATrr 

i ii iii ii mi iiiii u mil iiiii iimiim imm iiiii 

33 2158 ATGATGGAGGaAATTGGAgaCCAATaGTaCAGTTgTTAAGATATCAAAACATtGAATTTAcagCArTT 

imiiiiu mm i in ii m inn mm i n mi iiiii 

16 2164 ATGATCGAGGTGATTGGAaCCAAATtGTtAtCTTTTTAAGgTATCAAggtgTAGAgTTTATGTCATTT 

m i mm mi i i n n i iiiimii mm mi m imiiii 

31 2102 gTGAcGaAGGTGAcTGGAGGgAcATAGTaAAGrTTTTAAGATATCAACAAATAGAaTTTgTGTCATTT 

in i ii ii iiiii iiiii i ii i nn minimi in i inn 

18 2235 aTGAaGggGGaGAfcTGGAGaccaATACTgcAaTTccTgcGATACCAACAAATAGAgTTTaTaaCATTT 
con atgatggaGG— AtTGGA--ccaAr-GTacagTTt-TaaGatAtCAAaa-aTaGAaTTtat-- CaTTT 

atgatgg-aggaaattgga-JJll JJ12-cattt 

6 2203 nAACtAAAtTtAAATTATGGCTGCACGGtACGCCaAAAAAAAAcTCcATAGCCATaGTAGGcCCt.CC 

mi iii i imiimiimm mu iimm u mum iiiii n n 

11 2203 TTAAGcAAAcTaAAATTATGGCTGCACGGaACGCCcAAAAAAAAtTGTATAGCCATTGTAGGgCCACC 

hi i i i m i i i 1 1 i ii imm urn m u m i 

33 2226 TTAgGTGCATTtAAAAagTTTTTaaAAGGtATACCaAAAAAAAgcTGTATgcTAATrrgTGGaCCAGC 

iii mm nn urn mi iiiii mini n n n i i m mi 

16 2232 TTAaCTGCATTAAAAAgaTTTTTgcAAGGcATACCtAAAAAAAAtTGcATaTTAcTATATGGTGCAGC 

m minim mu nn i u u mu u u m u mum i 

31 2170 TTAtCTGCATTAAAgctgTTTTTAAAAGGAgTgCCaAAgAAAAAcTGTATtTTAaTAcATGGTGCAcC 

in ii urn iimumii u ii iiiii m i u i m n i 

18 230 3 TTAggaGCcTTAAAa tcaTrTTTAAAACGAaccCCc AAaAAAAAtTGTtTagTAtTtt gTGGacCAgC 
6 2271 aGAtACTGGGAAaTCGTaCTTTTGtATGAG TTTAATaAgcTTTcTaGGaGGtACAGTTATTAGTcAT 

ii mum mi nun iiiii mm i in i u u iiiiimmi n 

11 2271 tGAcACTGGCAAGTCGTgCTTTTGcATGAG TTTAATtAAGTTTTTgGGgGGaACAGTTATTAGTTAT 

i n ii mu i m i nm iiiiii mini u mu i i 

33 2294 aAAtACAGGaAAGTCATatTTTGGaATGAG TTTAATacACTTTTTaaAAGGGTgTGTTATATcaTgT 

ii mu ii mi mu mu mm i m i mm in mi i i 

16 2300 TAAcACAGGTAAATCATtaTTTGGt ATGAG TTTAATGAAaTTTcTgCAAGGGTcTGTAATATgtTtT 
III I I III I III III I II II! IIIII III III III I IIIII I I III III II 
31 223 8 TAATACAGGTAAATCATATTTTGGAATGAGccTTATTGAgCTTTtTACAAGGATgTaTAATATCATaT 

1 1 1 1 1 ( 1 1 ummummiiu un im mum mmm i 

18 2371 aAATACAGGaAAATCATATTTTGGAATCAGttTTAT acaCTTTaTACAACGACCagTAATATCATtT 
con -aAtACaGG-AAaTCaTatTTTgGaATGAG-tTTAataaacTTTtTacaaGGatc-gT-ATat — taT 
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6 2338 GTaAATTCC aGCAGCCATTTtTGG tTgC AaCC g tTAgt aGATGCtAAgGTaGCATTgTTaGATGATCC 

ii 1 1 1 1 1 1 uninm in i n ii n him n n him n imim 

11 2338 GTtAATTCCtGCAGCCATTTcTGGcTaCAGCCAcTAaCgGATGCAAAAGTgGCATTaTTgGATGATGC 

ii inn ii ii ii iii i mm n i imiiiii i i i i i mini 

33 2361 GTAAATTCTAAAAStCACTTTTGGTTgCAGCCATTAtCAGATGCAAAAATAGGAATGaTAGATGATGt 

1 1 1 it i ii 1 1 1 1 1 1 ii mum it mm mini mum m mmm 

16 2367 GTAAATTCTAAAAGcCATTTTTGGTTAC AAC C ATTAGCAG ATGC c AAAATAGG t ATGTTAG ATG ATGC 

i Mini urn immimimm i n urn iiiimi mini mini 

31 2306 GcAAATTCaAAAAGTCATTTTTGGTTACAACCAcTgGCtGATGCTAAAATAGGCATGTTAGATGATGC 

i urn i imnmiim mi i i m mi i i imimimm 

18 2438 GtgAATTCcAetAGTCATTTTTGGTTggAACCgtTaaCaGATaCTAAggTgGcCATGTTAGATGATGC 
con GtaAATTCcaaaAG-CAtTTtTGGtT-cAaCCatTagcaGATgCtAAa-TaG-aaTgtTaGATGATGc 

6 24 06 aACACAgCCATGTTGGAtATATATGGATACATATATGAGAAAttrgTTAGATGGTAATCCTATGAGtA 

urn imiimi mmmiimmiimii i mimiiiimimii i 

11 24 06 cACACAACCATGTTGGACATATATGGATACATATATGAGAAAccTaTTACATGGTAATCCTATGAGcA 
I I I I I II; I IN II ill I III II llllllll llllllll III II I 

33 24 29 aiACgCcAatAaGTTGGACATATATAGATGAtTAcATGAGAAATGCgTTAGATGGAAATgaAATTTCaA 

ii mm ii mum i i iiiiiiii ii mmm i mi i 

16 24 3 5 TACAgfcGCCcTGTTGGAAeTAcATAGATGAcaAttTAAGAAATGCATTgGATGGAAATttAGTTTCTA 

mi iii iiiiii i ii iiiii i i n imim i urn u n mi 

31 237 4 TACAACGCCaTGTTGGcAtTAtATAGAeAAtTAccTAcGAAATGCAcTAGATGGCAAcCCtGTATCTA 

ii ii i mm ii i ii i ii i mini inimm u n n 

18 2506 aACgACcaCgTGTTGGacaTActTtGAtAccTAtaTgaGAAATCCgtTAGATGGCAAtCCaaTAagTA 

-tA 



6 2474 TtGAcAGAAAgCATAaAGCATTgACATTAATTAAaTGTCCACCtCTgCTaGTaACgTCcAAcATAGAt 

i ii iiiii mi mm milium iiiiiiii ii n u n n n mil 

11 2 474 TAGATAGAAAACATAGAGCATTAACATTAATTAAgTGTCCACCgCTaCTgGTTACaTCAAATATAGAc 

iiiii iiiiiiii mm i i ii iiiiiiii ii ii mi mini i i 

33 2 49 7 TAGATGTgAAACATAGgGCATTAGTgCAAtTAAAATGTCCACCACTgCTtcTTACcTCAAATAcAAAT 

i iiiii ii inn mi n m mini n in i i mi n n i in 

16 2 50 3 TgGATGTAAAGCATAGAcCATTgGTaCAAcTAAAATGcCCTCCATTATTaATTACATCTAAcATtAAT 

i iimiimm i i ii i ii Himi inn iiiii iiiiimm n m 

31 2 4 42 TaGATGTAAAGCATAAAgCtTTAATgCAgtTAAAATGTCCTCCtTTATTgATTACATCTAATATAAAT 

i in iiiiii iii i iiiii ii mimiimi n i in i mm ii 

18 2574 TtGATagAAAGCAcAAAcCaTTAATaCAacTAAAATGTCCTCCaaTAcTacTaACcaCaAATATAcAT 

con TaGAt--aAAgCAtA-agCaTTaatacaa-TaAAaTGtCC-CCacTacTa-TtACatCaAAtAtaaAt 
6 2 54 2 ATTActAAAGAaGAtAAArAtAAgTATTTACATAcTAGAGTaACaACATTTACATTTCCAAATCCATT 

mi mil ii iiiii ii minim iiiiii ii mnnimmmmm 

11 254 2 ATTAGCAAAGAgGAgAAATAcAAATATTTACATAGTAGAGTtACcACATTTACATTTCCAAATCCATT 

m iii i ii iimimiiiim i ii mi mi mmm 

3 3 25 65 GCaGGCACAGAcTCTAGATGGCCATAITTACATAGTAGATTaACaGTATTTgaATTTaaAAATCCATT 
II II IIIII IIIII IIIII llllllllll IIIIII II Ml Mil III II 

16 257 1 GCtGGTACAGATTCTAGgTGGCCtTATTTACATAaTAGATTGGTGGTgTTTACATTTCCtAATgagTT 

ii mi m ii iiiii n iiiiii iii mini nnmini in n 

31 2510 GCAGGTAACGATgAcAGATGGCCATAccTACATAGcAGAcTGGTGGTtTTTACATTTCCAAATcCATT 

m mm i minimi u i u m i u m miuuii mi 

18 264 2 0CAGcaAAGGATaAtAGATGGCCATAttTAgAaAGtAGAaTaacaGTaTTTgaATTTCCAAATgCATT 
con 9c-ggtAaaGAtgatAgaTggccaTAttTAcAtAgt:AGA-TaacagtaTTTacATTTccaAATccaTT 
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6 2610 CCCtTTTGACAGAAATGCGAATGCAGTgTATGAACTgTCAaATaCAAACTGGAAATGTTTtTTTGAAA 
Ill tlllimilltlttmillt! IIIIIMi til II Itllllllllllllll iiiiiii 
11 2610 CCCeTTTGACAGAAATGGGAATGCAGTaTATGAACTATCAGATGCAAACTGGAAATCTTTcTTTGAAA 

iii inn nun n i it i mi i ii mi m iiiiiii ii ii ii 

33 2633 CCCATTTGAtGAAAATGG tAAcCCAGTGTATGcAaTAAATGATGAAAAtTGGAAATCCTTTTTCTCAA 

iiiimi i itt i ii ii minim t iiim t m miimmtmMii 

16 2639 TCCATTTGACGAAAACGGAAATCCAGTGTATGAgcTtAATGATAAgAACTGGAAATCCTTTTTCTCAA 

Minimi Mini mi imm! mil i i mm imiimiimimiiM 

31 257a TCCATTTCACAAAAACGGAAATCCAGTATATGAAtTAAgTGATAAAAACTGGAAATCCTTTTTCTCAA 

IIIIIIIII Mill II MIMMIIMim Ml Ml IIIM IIIIIII IMM II 

18 2710 TCCATTrGAtAAAAAtGGcAATCCAGTAtATGAAaTAAaTGAcAAAAAtTGGAAATgtTTTTTtgaAA 

con — CCaTTTGAcaaAAAtGG-AAtcCAGT-TATGaacTaaatgAtaaaAAcTGGAAAT — TTtTT AA 

6 2678 GACTGTCGTCaAGCCTAGACATTcAGGATTCtCACGA CGAGCAA GATGGAAGCAATAGCCAA 

iiiiiiiiii iiiMiiiim iiiiiii mil iiiiiii iimiMiiiiiiiiM 

11 2678 GACTGTCGTCCAGCCTAGACATTgAGGATTCAGAGGA CGAGGAA GATGGAAGCAATAGCCAA 

I II II II MM I II mm I Mill Mill I MM I 

33 2701 GGACGTGGTgCAaATTAGATTTaataGAGGAAGAGGA CAAGGAAAACcATCCACGaAATATCagc 

iiiiiiiii ii mi iii iii 1 1 imii Milium mm i i 

16 2707 GGACGTGGTcCAGATTAAgTTTGCACGAGGAcGAGGA CAAGGAAAACGATGGAGACTCTTTgcCA 

IIIIIIIII MINIM 1 1 II M II 1 1 II Mill III Ml 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II M 

31 264 5 GGACGTGGTgCAGATTAAATTTGCACGAGGAAGAGGA CAAAGAAAACGATGGAGACTCTTTCtCA 

MM MM liliMt 1 1 1 1 1 f 1 1 1 1 ] 1 1 i I i 1 f t III I Ml III M IMII I 

18 27 78 GGACaTGGTcCAGATTAgATTTGCACGAGGAAGAGGAagatgcAGAcAcCGAaGGAaACcCTTTCggA 

con GgacgTgGTccAgatTAgattTgcacGAggaaGAGGA c-agGAaaacgAtGGA — ca-T-tcc-a 

6 27 4 0 GCGTTTACATGCGTGCCAGGAaCAGTTGTTAGAACTTTATGAAGAAAACAGTAcTGAccTACACAAAC 

Mmmmiimmii iiimmimmimimmmi m mmm 

11 274 0 GCGTTTAGATGCGTGCCAGG AtCACTTGTTAGAACTTTATCAACAAAACACTAtTCATaTACACAAAC 

mm mi urn i i im urn mi i i i mi m n 

3 3 27 66 ACGTTTAAATGCagtgCAGGAgAAAATACTAGAtCTTTAcGAAGcTGATAaaACTGATtTACcatcAC 

miimm mil mmm i m m iiim n ii m i 

16 2 7 72 ACGTTTAAATGTGTGTCAGGACAAAATACTAacACATTATGAAAATGATAGTACAGAcCTACGTGAcC 

Mmmimiimmimim n mmmmiimm i n mi i 

31 2711 ACGTTTAAATGTGTGTCAGGACAAAATAtTAGAACArTATGAAAATGATAGTAAACgaCTttGTGAtC 

iimiM i milium mi n iimimimi mm i i 

18 2 8 4 6 ACGTTTAAgTtgcgtgCAGGACAAAATcaTAGAcCAcTATGAAAATGAcAGTAAAgacaTagacagcC 
con aCGTTTAaaTgcgtg-CAGGAcaAaaTatTAgaaC-tTAtGAA-atgA-AgtAc-gaccTacacaaaC 

6 2 8 08 AtgTatTGCATTGGAAATGCATgaGAcatGAAAGTGTATTAtTAtAtAAAGCAAAACAAATGGGCCTa 

i i miiiimiiiiiiii ii mmiimi ii i ii 1 1 ii i ii ii 1 1 m ii i m 

11 2 808 AcATTaTGCATTGGAAATGCATACGAtTGGAAAGTGTATTAcTAcAcAAAGCAAAACAAATGGGCCTg 

i iii mmm inn im m mi i i m immmimi i 

33 2 8 34 AaATTCAaCATTGGAAACtgATACGCaTGGAgTGTCCTTTATTgTAtAcAGCCAAACAAATGGGATTT 

i ii ii mmm n m i n mm mm mm i imimmmi 

16 284 0 ATATAGACTATTGGAAACAcATgCGCCTaCAATGTCCTaTtTatTAcAAgGCCAGAGAAATGGGATTT 

mimiimimm n m n mini i n n n mnmiiii i 

31 2 779 ATATAGACTATTGGAAACAtATtCGaCT tGAATGTG tAtTAaTgTATAAAGCAAGACAAATGGGAATA 
I III I III III III MM III 111 II I I I mm III II III 

18 2914 AaATAeAgTATTGGcAACtaATaCGttggGAAaaTGcAaTAtTcTtTgcAGCAAGgGAAcatGGcATA 

con AtaTagag-ATTGGaAAc-cATacGactgGAa-gTG-atTatt-tataaaGCaA-a-AAatgGG — Ta 
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6 287 6 AGCCACATaGGaaTgCAAGTACTgCCACCATTAAagGTGTCcGAagCaAAAGGACATAATGCcATTGA 

iiihin ii i niiiiii Minimi inn n i iiimiimm inn 

11 2876 AGCCACATcGGgtT»CAAGTAGTaCCACCATTAActGTGTCAGAGACtAAAGGACATAATGCtATTGA 
II I I II II II II I II I III Nil I I I I I I I I Mill 

33 2902 tcACATtTatGGCACCAgGTGGTGCCttCtTTGttaGcATCAAAGACcAAAGCATTTCAAGtAATTGA 

m i i mm niiiiii i ii i muni immi mi mm 

16 2908 aAACATATTAACCACCfcaGTGGTGCCAaCacTGgCtGTATCAAAGAatAAAGCATTACAAGCAATTGA 

i minium 1 1 ii 1 1 1 1 1 i ii i miimi urn imnn mil 

31 284 7 CAcAgTATTAACCACCAGGTGGTGCCAGCgTTGtCaGTATCAAAGgccAAAGCcTTACAAGCTATTGA 
II I I I M I I I I 1 I I I 1 II I I II I I I I Mill lllll IIIIMMM 

18 2982 CAgAcatTaAACCACCACGTGGTGCCAGCcTa'.aacaTtTCAAAaagtAAAGCacataAAGCTATTGA 



6 294 4 AATGCAAATGCATTTAGAATCaTTAttAAggACTgAGTATaGTaTGGAACCgTGGACATTACAaGAaA 

1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 III II III lllll II IMIMI I i 1 1 1 1 1 1 1 1 1 M I 

11 2944 AATGCAAATGCATTTAGAATCcTTAgcAAAAACTCAGTATgGTgTGGAACC tTGGACATTACAgGAcA 

I I IMMI Mill I Ml Ml I II II 1 1 II I I II Ml II II I I 

33 297 0 ACTaCAAATGgCaTTAGAgACATTAagTAAATCACAGTATAGTAcaagcCAaTGGACATTgCAAeAaA 

Ml III I I Mill III II III lllll MIIMI I Illlllll III I 

16 2976 ACTgCAACTaACGTTAGAAACAaTAt ATAACTCACAATATAGTAATGAaaAgTGGACATTaCAAgAcg 

ill iii i i iii mm ii inn i nn i mn i mm i n i 

31 2915 ACTaCAAATGAtGTTgGAAACAtTAaATAACACTgAATACAAAAATGAGGAcTGGACAaTGCAgcAa A 
III MUM III I Ml Illlllll lllll llllll llll I I 

18 30S0 ACTgCAAATGgcccTacAAggccTtgcacAaAgTcgATACAAAAccCACGAtTGGACAcTGCAagAcA 



6 3012 CaAGTTATGAAATGTGGCaAACACCACC tAAACGcTGtTTTAAAAAACgGGGeAAaACTGraGAaGT 

I iii illinium iimnn inn n imimimi mi ii mim m ii 

11 3012 CcAGTTATGAAATGTGGCTAACACCACC CAAACGgTCcTTTAAAAAACAGCGAAAtACTGTgGAgGT 
llll II llllll! Ill I I II MIIMMMI III I II II II 

33 3038 CaAGCTTaGAgGTGTGGCTt t gTG aACCACC AaaATGrTTTAAAAAACAaGGAgAaACAGTaactGT 
III I II llll I II llll I I llll I Illlllll III I lllll II 
16 3044 ttAGCCTTGAAGTGTATTTAACTGCACCAAC AG G AT G T aT AAAAAAAC ATGG ATAT AC AGT g G AaGT 

II I III I I I I III I I I I I III III II III III III III I II III I II I I I I M II II - 
31 29B3 CAAGtCTTGAACTGTATTTAACTGCACCTAC AGGgTGTTTAAAAAAACATGGATATACtGTAGAgGT 

II I lllll I II I llllll 1 f 1 I I I I I I Ml I II Ml I II 
18 3118 CAtGcgagGAACTaTggaatACaCaACCTACtcactgcTTTAAAAAA ggTCGccAaACaGTAcAaGT 

con caaG-t-tGAa-TgTggctaac-gcACCaacaa-g-tgttT-AAAAAacatGGa-A-AC-GTagaaGT 

6 3079 tAAATTTGA TGGCTGTGcAaACAATacAATGGAtTATGTGGTATGGACAgATgTgTAtgTGCAGG 

nimii immi i iiiii mm MMMMimm ii i n Mini 

11 3079 aAAATTTGA TGGCTCTCAAgACAATgtAATCGAgTATGTGGTATGCACAcATATATAccTGCAGG 
Ml III II I III llllll III III I lllll I I 

33 3105 GCAATaTGA caatGACAAAaAaAATACAATGGATTATACAAACTGGggtgAaATATATATTataG 
Ml I III llll I limilll llllllllllllll I I I I 11 I 1 t I I 

16 3111 GCAgTTTGATGG aGACATAtgCAATACAATGCATTATACAAACTGGAcAcATATATATATTTGTG 

Ml Illlllll || || III M llll IMIMI llll III I llllll I III 

31 3050 GCAaTTTGATGG tGAtCTAcACAAcACcATCCATTATACtAACTCCAaAtTTATATAccTaTCTa 

I Illlllll III I I II I II II I I 

18 3185 atAtrTTGATGGcaacaAaCacaAttgtAtgAcctATgtagCatgggacAgtgTgtatTAtaTgacTg 

con gcAartTGAtggcaacgatgaaaacaatacaAtggAttat-caaactggacagatataTAtaTg — tg 
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6 3144 ACAAtGACaCcTCGGTAAAgGTgcaTAGTatgGTAGATGCtAAGGGtATATATTAcACATGTGGACAA 

mi in i iiiiiin ii mi mum urn mum uiimmii 

11 314 4 ACAAcGACtCATCGGTAAAaCTaACTAGTtccCTACATGCeAAGGGcATATATTATACATCTGGACAA 

i i ii mi i ii ii i mm i ii u mini i i 

33 3170 AgGAACAtaCATCtACTATGGTtACaGGgaAAGTAC ATTATAtaGCTATgTATTATATACATaActGt 

I Mil II III llll II llll II III III I 1 I I I It Mil I I 

16 3176 AAGAAGcatCAgtaACTCTGGTaGAgGGtCAAGTTGAcTATtAtGGTtTaTATTATGTtCATGAAGGA 

ill i i urn ii ii ii mm n i i u i iiiimi mimn 

31 3115 tAGAtCGccaATGtACTGTtGTgGAaGGgCAAGTTAATTgTAAGGGcaTtTATTATGTACATGAAGGA 

i ii m i iiii i m i iiimm i um 

IS 3253 atGcaCGaacATGggacaaaaccGcfcacctgtaTaAgTcacAgCGCatTgTATTATGTAaAgGAAGGg 
con a-gaaGacaeatgg-cta-ggt-g-t-gt-aaGTagattataagGGtaTaTATTAt-tacatgaagga 
S 3212 TTTAAAACATATTATCTAAAcTTTgtaAAAGAGGCAgAAAACTATCGgAGcACCAAaCATTGGGAAGT 

1 1 1 1 1 1 1 1 1 n 1 1 1 1 m 1 1 1 iii mmm minim u um minimi 

11 3212 TTTAAAACATATTATGTAAATTTTAAtAAACAGGCAcAAAAGTATGGTAGtACCAAtCATTGGGAAGT 

u um i i imiiii i ii ii mini n n i mum 

33 3238 gaAAAggtATArTTTaaAtATTrTAAAGAgGATGCtGcAAAGTATtcTAAAACacAAaTglGGGAAGT 
I IIIIMI I MIIIMI Mill I III III IMII II I IIIIUII 

IS 3244 atAcgAACATATTTTGTgcAgTTTAAAGAtGATGCAGAAAAATATaGTAAAAaTAAAgTATGGGAAGT 

iiimumi i mm in ii mi iimiii i i mi niinui 

31 3183 cAtAfcAACATATTTTGTAaAtTTTAcAGAaGAgGCAaAAAAATATGGGActggTAAAAaATCGGAAGT 
II II I M I II I llll I II MIIMIIIIM I MIIIMI 

18 3321 tAcAacACgTtTTaTaTAgAaTTTAaAagtGAatgtgAAAAATATGGGAacacaggtAcgTGGGAAGT 

con fc-taaaacaTaTT-TgtaaAtTTTaaa-aaCAggcagaAAA-TATqg-Aa-ac-aaaaa-TGGGAAGT 



328C 



ATGTTATGGCAGCACAGTTAT 

I M 1 1 1 11 1 1 i 1 i 11 i 1 1 1 1 1 

ATGTTATGGCAGCACAGTTAT 
I i II I MM 
ACATGtGGGTGGTCAGGTAAT 

mi Minimum! 

tCATGCGGGTGGTCAGGTAAT 

1 11 1 1 1 1 1 M M 1 1 ! 1 1 M I 

gCATGCGGGTGGTCAGGTAATTG 
Ml II I I IIIMM 



ATCTTCTCCTGC 

MIMIIMMI 

ATGTTCTCCTCC 
II llll I 

tgrrrcTCCTAC 

I mum 



ATCTGTATCTAGCACTacACaAGAAGTAT 

I i 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1111)111 

ATCTGTATCTAGCACTgtACgAGAAGTAT 



III I 11 I I II II 
gTCTaTATCTAGCA 



I 



ACCA 
II I 

atTaTGTCCTACATCTGTgTTTAGCAGCA ACGA 

I I llll IIIMI I I 1 1 1 1 M MM 

TTTtTCCTgaATCTGTaTTTAGCAG TGACGA 

- i ii in nun 



33 3306 1 

16 3312 t 

31 3251 < 

18 3389 aCATtttGGgaaTaAtCTAATTGattgTaaTgactctatgtgcagTAcCAG TGACGAcacggTAT 



I ill 
AaTAT 
I III 
AgTAT 
I III 
AaTAT 
III 



6 3342 CCATTCCTGAA tCTACTACATACACCCCCGCACAGACC tCcaCCCT tGTGTCCtCaaGC 

inn um 1 1 1 1 1 1 1 1 1 1 u 1 1 e m 1 1 n 1 1 1 i i mi mm i m 

11 3342 CCATTgCTGAA CCTACTACATACACCCCCGCACAGACCacCgcCCCTacaGTGTCCgCctGC 

iii I mil i limn mm i i i 

3 3 336 2 CCACTaCTGAAACTgcTgACATACA gACAGAC 

I I II IIIMI II II II 

IS 33 71 CCTCTcCTGAAAtTATTaggcagCA cttgGCC 

III II III I III 
31 3310 CCT tTGCTggGATTGTTAcAaACCTACcaacaGCC 

m i ii i mm i mm 

18 3454 CC gcTaCTcaGcTTGTTAaAcAGCTAC 



AAC gaTaacCGACC AC 

III IMII II 

AACcACcCCgCCgCGACCcATAC 
III II II II I I Ml 
AACaACACCaCCaCAtCGaATTC 

; um u ii ii nu 

AgcACACCcCCtCAcCG'.ATTC 



— caacaac-ccccc-caacc-acac 
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6 3403 CA 
II 

H 3406 CAC 
III 

33 3410 CACAAGC 

II I I I I 

16 3426 CA AAGCCgtCGCCTTGCCCACC 
II II II lllllllllllll 



aCGAACACGcaGTCcaaaCGCCGCCTAGGAAaCG 
mum III lllllllllllll || 
CGAAGACCGcGTGtcggCGCCGCCTAGGAAgCG 
II I I 

agcggccAAACgacGACGAC cTgCAGacACCA 

mi nun i iii mi 

GAAG AAACacaGACGAC TATcCAGCGACCAAGA 

mi nun n in inn in 

31 336 8 CA AAACCtgCGCCTTGGGCACCaGtGAAGgtgrgCGGCgggcGACGACGTCTActaACCCACCAAGA 

II III II llllllll I III I llll llllll 

IB 3503 CA gcACCgtgtCCgTGGGCACC GcaAAGaccTaCGGC caGACGTC 

BE10-GGCGXGXCGGCGCCCCCXAG-BE10 

con CAcaaaccgtcgccttgggcacc-g-gaaggcgtacgaagac-gacgacgtcc-cc-ag — accaaca 

6 3439 AGCACGaggagtccaACaGTCCcCTtgCAACgCCtTGTGTGTGGCCcACATtgGAcCCGTGGACAGTg 

iiiiii ii iiii ii mi ii minimi mi n imiiiim 

11 3442 AG CAC G tggACCGTCCaCTaaCAACaCCcTGTGTGTCGCCaACATCaGAtCCG?GGACAGTA 
II I I III I I I II mm II 

33 3449 CAGACACCGCCcAgCCCCT tacaaAgcTGTTctGTGCA gaCccCgCCtTCGACAaTA 

lllllllll I lllll I I llll I I llllll I 

16 3481 tCACAGCCACACACCG GAAACCCCTgCCACaCCActAAGTTGTTGCAcaGaGACTCaGTGGACAGTG 

iimimi i iiiii ii mi m Milium i inn minim 

31 3435 aCAGAGCCAGAGcaC aGAAACiCCCaCCACcCCAacAAGTTCTTCCCAgGCCACTCcGTGGACAGTG 

i i ii ii ii i i i i i nil i mni n 

18 3548 GgctgctAcACgaCCtggAC&CtgtggAcTcGcgGaGAaGCagCattGTGGAC cTG 

con acaaagccaga— ccgc-aaaCccct-c-acaccatgt-tttggtgcacagcggctccgTGCACagTg 



6 3507 g AAaCc ACAAC cTC aTCACTaAC 
II I lllll II lllll II 
11 3504 cAAtCaACAACATCGTCACTgAC 
II I I III II 

33 3506 gAACAgcaCgtACTGCAACTAACtGC 
II I llllll llll II 
16 354 8 cTCCAaTcCtcACTGCAttTAACAGCT 
llll I lllll 

31 3502 TCAACTgtggggTTaTCaGTGCAGCT gcatgCACAAAccAAACAA 

iiiii ii ii imiiii nm 

18 3604 TCAAC ccacTTcTCgGTGCAGCTacacctacaggcaacAACAAaagacGCaaacTCtGTaGTgg 



AATcACgACcAGCACCAAA 
III II II lllllllll 
AATtACAACAAGCACCAAA 
llllllll I I 
aCAAACAAGCAgCGGA 

ii iiii mi 

CACACAAAggACGGA 
I 



GACGG AACAACaG 
II II llllll I 

GAaGG AACAACTG 

I I I 

cTgtGTGT AgTTc 

I III II 
tTaaCTGT AaTag 

mi i 



c — aac-c — cactgc-actaaCagctaat-c-aacaagcacca-Aa— 



— gggtgtcaaca-t-g 



6 3562 TaACAGTtCAGCTACGCCTATAGTGCAAtTtCAAGGTGAaTCCAATTGTTTAAAgTGTTTTAGATATA 

i iiiii imimummmi t imiiii nmimmn mnmnm 

11 355 9 TcACAGTGCAGCTACGCCTATAGTGCAAcTgCAAGGTGAtTCCAATTGTTTAAAATGTTTTAGATATA 

i ii mi minimi i mini n m mmnmn nm 

33 3561 TAACgtTGCA CCTATAGTGCATTTAAAAGGTGAaTCaAATAGTTTAAAATGTTTAAGATA 

mi i ii ii nm nmmmm i im mmnmimm 

IS 36 0 3 TAACACTACA CCcATAGTACATTTAAAAGGTCATGCt AATAcTTTAAAATGTTTAAGATA 

i mm ii in mi imnimmi mi unnmimmi 

31 35 6 3 TgcaACTACA CCTATAATACAcTTAAAAGGTGAXGCAAATAtaTTAAAATGTTTAAGATA 

i mn minimi minimi in i iiinimm i n 

gCCTATAATACAtTTAAAAGGTG Ac agAAAcAg t TTAAAATGTTTAcGgTA 
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6 3630 GgCTaAATGACAgAcAcAqACATTTaTTTGAtTTAatATCaTCAACGTGCCAcTCGGCCTCctCaaAG 

i n nun i i i huh inn in m milium mum i n 

11 3627 GaCT^AATGACAaATATAAACATTTCTTTGAiTTAqcATCt.TCAACGTGGCA'rrGGGCCTCacCtgAG 

m mm i mi i i n n n iiimm n i 

33 3621 cAGATTaAAAcctTATAAAgAgTTGTATAgPtCtaTGTCaTCcACcTGGCATTGGACCaGtgAcAAc 

urn iii ii imm i i mi n n imimm i i n 

16 3663 TACATTtAAAAAgcATtgtacATTGTATAcTgCAGTGTCgTCTACATCGCATTCGACAgGacAtAAT 

iii i m ii mini mm iiiimmiiiiim i n 

31 3623 TAGgcTGtcAAAAtATAaacAATTGTATgaAcAAGTGTCATCTACATCGCATTGGACAtGtecagAT 

ii ii mi m i iii i i i 1 1 1 1 1 ii minium 

18 3728 cAGatTGcgAAAAcATAgcgAccacTATagAgAtaTaTCATCcACcTGGCATTCGACA ggtgc 
:a-ttgTaT-a-t-a-t-TCaTC-AC-TCGCAtTGGaC-tg-cc-aa- 

6 3698 GCACCACATAAA cATGC cATTGTAAC t gTAACAT ATgatAGTGACGAACAAaGgCAaCAgTTTT 

mmmm mi mum nnn n mmiiim i n n mi 

11 369S GCACCACATAAAA ATG C AATTGT AAC a tTAACA? ATAgcAGTGAGGAACAACGtCAgCAaTTTT 

i inn m iiimm iiiin i n nmi i mi 

33 3688 aaAAAtagTAAAA ATGgAATTCTAACcgTAACATtTGtaAcTGAAcAGCAACAAC AaatgTTTT 

m inn ii mm n i mi n i mi inn i mi 

16 3730 GtAAAACATAAAA gTGCAATTGTtACacTtACATATGatAGTCAATgGCAAC GtGACcAaTTTT 

I minimi in nm n i mm m u in i in i mi 

31 369 0 GGAAAACATAAAAA TGC tATTGTaACc tTAACATATat aAGTac ATCACAA AGAGACgAtTTTT 

i ii nm i ii i ii imm iii i nm m i mi 

18 3791 aGgcAAtgaAAAAAcaGgaATacTgACt gTAACAT Ac catAGTg a AaCACAA AGAacaaAaTTTT 
con g-a-aacatAAAaaatGcaATtgTaACtgTaACATatgatagt-aa-agcAAcaaag — aacaaTTTT 

6 37 62 TAgAtgtTGTAAAAATACCcCCtACCATTAGcCA CAAacTGGGaTTTATGTCAcTgCAccTATTGTA 

ii i i j 1 1 1 1 1 1 ( i f i if iiiimi ii ii mi immn i n mini 

11 3759 TAaAcAgTGTAAAAATACCACCcACCATTAGgCAT AAGgTGGGgTTTATGTCAtTaCATTTATTGTA 

ii i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i i ii m ii mm 

3 3 37 52 TAggTAccCTAAAAATACCACC tACTgTgcaAAT AAG TACTGGATTTAT 
I I II lllllllll III I II I ) I II I I I I I I I 

16 3794 TgtcTcaaCTtAAAATACCA AAaACTaTtaCAGT GTC TACTGGATTTAT 

i i ii iiiimi ii n i im iii nm 

31 3754 TAAATACTCTaAAAATACC t AAc AC a gTa tC ACT GTCaacaggatatatgactATTTA 

imimii 

18 3856 TAAATACTGT 

con TaaatactGTaaaaataccaccaaaca-tagcaat-aaggtcgg-tttatgt-actg-atttattgta 
6 3929 AtttgtatatatgtaaAtgtgTaaATATATGgTATt9CTGTAatacaActgTACaTGTATGGAaGTgG 

i i i mini m nm i in mum n i 

11 382 6 A cCATTAcaccTgtATATATG TATAtGTGTA CATAACATACgTGTATGGAgGTaG 

lllll It lllll III III I III! 

33 3801 G aCATTA TAaGTGTA CATcACAagCcaaTATG 

I 

16 3843 G tCt 



tgcaattccagatagtgtacaaatattggtgggataCa 
— catta t — atatatggtatatgtgta — cataacaaacatgtatggaagccg 



ss 
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6 3897 TGCCTGTACAAATaGCTGCAGgAACAACcAgcACATTcATAeT GCCTGTTaTaATTGCAT 
lllllllllllll lllllll llllll I lllll III I lllllll I lllllll 

IX 388 X TGCCTGTACAAATTGCTGCAGcAACAACT AC AAC ATTGATATT CCCTGTTC XTATTGCAT 

llllll I II III lllllll I Mil I I 

33 3833 TGCTGC TaAtTgtATAtAACcATGATATTtgTTTTTG taTTaTCTTTTaTaTT 

I llllll I llllll II 111 I I 

16 3847 atATGA caAArcTtgatacTqcATccAcAaCATTAcTGgcgTGCTTrTTG CTTTGCTTT GTGTG 

mi ii iii mi n niiinii iiiinin inn 

31 3815 TaATGATtgAActaAatattTcTAcagtaAgCATT gTGCtaTGCTTTTTG CTTTGCTTTTGTGTG 

it n n i in i it ii inn iiiiiiiiiiiii 

18 39 04 TgAcaATgtAAtacAtatgcTgTAgtaccAatATgttatCacTtaTTTTTttaeTTTGCTTTTGTGT 



6 3957 TTGttGTATGTtrrgTTAGcATcaTACrrAltgTATggATATCTGAgTTTaTtGTgTAcACATCTGTG 

in nun ii mi n imm m imnn in i n n iiinnii 

11 3941 TTGcaGTATGTaTTcTTACtATtgTACTTATaaTATTAATATCTGAtTTTgTaGTaTATACATCTGTG 

iii i i ni mm I II III I 1 II 

33 3886 gT TTTtaTGctTaTcCtTATTAtTACGTCCtTTAATAcTTTCcaTTTCTACcTATgCtTggTTG 

I III II I I I lllll lllllll I I I II I lllll II I I II 

16 3911 CT TTT GTGTGTCTgCcTATTAATACGTCCgCTgcTttTGTCTGTGTCTACATAcaCATCATTA 

ii m 1 1 1 ii ii i i 1 1 f 1 1 1 1 1 ii i mimm III II II II 

31 3880 CTactaTTT GTGTGTCT tgTcATACGTCCaCTtgTgcTGTCTGTGTCggtATAtgCAaCAcTA 



6 4025 CTaGTACTAACACTgCTTTTATATTTaCTaTTGTGGCTgcTATTAACAACCCCcTT GCAATTtTTcc 

ii mimim minimi n iiiniii iiimmmi n mm n 

11 4009 CTGGTACTAACACTTCTTTTATATTTGCTTrrGTGGCTttTATTAACAACCCCTTT GCAATTcTTTT 

mil i i i i i imm i i iiiii mi nn 

33 39S0 CTGGT gTTGGTATTgcTgcTtTggGtgTTTGTGG gAtCtCCTTToaaAATT TTTT 

I I mum ill ii 

16 3974 aTAaT ATTGGTATT acTaTTgTGGaTAACaGCAgCCTCTgCgTTTaG 

II I III I I I II III I I III lllll I II I 
31 39 4 3 cTAtT ATTaaTtgT gaTtTTaTGGgTtAttGCAaCCTCTcCaTTacG 



30 



EP 0 477 972 B1 



6 4 092 TACTAACtCTactTGTGTGtTAcTgTCCcCCaTTgTATATACACtacTAtATTGT taccacA 

mi ii i ii nun ii i iii ii ii i mi ii 1 1 ii inn i 

11 4 0 76 TACTAACaCTgtgTGTCTGCTATTTTCCTGCcTTTTATATACACataTACATTGTgcaaacgcAAcaA 

i mum i immi n ii n 

33 4005 T tTGCTATTT GTTdtTTTTATAT tTA CCAaTGaTgTgtATtAATtt 

i mil in mi n m i ii ii n 

16 4021 gTGTTTtaTTGTATATATtaTATTT GTT TATA TA CCATTATTTTTAATaCATAC 

urn iiiimm i i m mi n mmm im urn 

31 3990 tTGTTT TTGTATATAT gtTgT GTT TATA TAtattCCATTATTTgTAATtCATAC 



6 4154 cAgcaaTGATGcTAACaTGTCAaTTtAATGATGGaGAT ACcTGGcTGggTtTGTCGTTGTTatgTG 

i mil im inn u mum in n in n i iiinuni n 

11 4144 TAATGgTGATGtTAACcTGTCACTTaAATGATGGtGAT ACATCGtTCTtTcTGrCGTTGTTtacTG 
I III I II II I I III II lllll III I II I II 

33 4051 TCATGCACAgcaTAtgacacaACaagAgTAATGTATAT ACATGtaTaTATTGTTtGTATATAtgTG 

umn i minim mi mm urn 

16 4075 ACATGCAC gcTTTTTAAttacaTAATCTATATGTACATAaTgtaATTGTT ACATATA 

mini mini n i i i u u n 

31 4044 ACATGCA tctTTTTTAA GTCAAcAgTaaCTTrTTT AC 

I I II lllll I 

18 3971 AtgcatgtatgtgtgctCcCAtgtcccgCTTTTgccAtctgtctgta 



220 CcTTTaTTGTAGggaTgtTgGGgTTaTTaTT gaTgCAcTAtAGaGCTGTACAaGGggaTaAAc 

i m mm i i u ii u u i n u n mum n i n 

210 CATTTgTTGTAGcTGTacTfcGGATTG TTGTT acTaCATTAcAGgCCTGTACATCGTacTgAAA 
II II I III I I llll II II I II I III I 

.117 CA caTGGTGgTGTtTTAacATTGTTGTT gTTATTTT AgtTTTTTTTTTTTTGTA 

I II II I II lllllll I I I M I I I I I I I II 

,132 atTgTTGTATACcaTaActtactaTtTTttctTtTTTATTTTcaTatAtaaTTTTTTTTTTTGT 

i mini in im i i ii i i 

,081 TtGTGTATAC TgTTgtTTgTatTgCTattggTaTTggTaTTgg 

llll ll ll I 

,018 tgtgtgcGTaTgcAtgggtattggtatttgtgtatatTgtggTaataacGTcccctgccacagcaTtc 



6 4283 AcACcAAATGTaagAAGTGTAA CAAAC aCAActgTAaTGatGATTATGTaacTATGcattATacT 

i ii mm mum urn in u u mum mi u i 

11 4273 AaACTAAATGTgctAAGTGTAAATCAAACcgCAAtacTAcTGTgGATTATGTGTATATGtcacATggT 

inn mil i iii mm 

33 4171 ttACTAA TAAAT AccTTTATaTtt.tagcaGTGTA? 

I till 
16 4196 TTgTTTgiTTgTTTTTTA 

ii iii ii mini 

31 4124 tattggTATtggTaraaTaaacTTTTTTaCTTtrTTTTTA 

III I I I I II I I II I 

18 4 08 6 acagtaTATgtaTtrtgTttttTaTTgccCaTgTTacTattgcatatacatgctatattgtctttaca 

con a-actaaatgtattaagtgtaatt-t — cc-t — tttT-atgttgattaagtgtatatg tataot 

021 -ACACCAAATGTAAGAAGTGTAA CAAAC ACAACTGTAATGATGATTATGTAACTATGCATTATACT-02 1 
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6 4348 acTgATCGtGATTAT aTaTAtATGAAtTAGACTAAACCgTTTTTTATAtttgtaacaGTCTAtGc 

i mi nun i ii him minimi iinmii mil i 

11 4341 gaTaATGGaGATTATG TgTAcATGAAeTAGAGTAAACC TTTTTTATAcagtgtgtgGTCTAcGt 

mm 

33 4206 fcATTATG 

16 4214 ataaactgTTATTA 

mm 

31 4164 TTATTA 

I I 

18 4154 gtaattgtataggtlgttttatacagtgtattgtacattgtatattttgttttataccttttaTgcTt 



6 4413 TttgTATAccATggcacAtagTAGGGCcCGacGACGcAAgCGTGCG-CAGCtACACAgCTATATCAAA 

i mi ii i mm ii mi n miimm nm iiiiiiiiii 

11 4405 TagtTATA tATAatgAAACcTAGGGCACGcaGACGtAAaCGrGCGTCAGCcACACAACTATATCAAA 

i i mi ii i i i nm ii ii 1 1 1 1 1 1 1 1 1 1 1 mi 

33 4213 AGACACAAACGaTCTaCAAGgCCCA AgCGTGCATCtGCaACACAACTATAcCAAA 

mimiii iii m mi i miimm ii ii nm n in 

16 4228 CttaacaATGCGACACAAACGtTCTgCAAAACGCACaAAACGTGCATCgGCTACcCAACTtTATaAAA 

i urn mm in Milium muni n nm m i m m 

31 4170 C cATGCGgtcCAAACGcTCTaCAAAACGCACTAAACGTGCgTCtGCTACaCAATTaTATcAAA 

I I III I I I I I I 

18 4222 tttgtattTttGtaatAAAaGtatggtAtccCaCcgTgccgcacgacgcaaacgggctTcggtaactg 



6 44 81 CATGtAAactCACTGGaACATGCCCCCCAGATGTAATTCCTAAgGTgGAgCACAacACCATTGCAGAT 

mi ii mm nm immiiiiimmu u u u i n nmim 

11 4472 CATGCAAGGCCACTGGtACATGtCCCCCAGATGTAATTCCTAAAGTtGAACAtAcTACtATTGCAGAT 

miiiiiiMM ii ii ii u ii mil milium m i m u mm 

33 42 6 8 CATGCAAGGCCACAGCcACcTCcCCACCcGATGTTATTCCTAAAGTgCAAGGaAgTACcATaGCAGAT 

muii mi m n nm m nn nm n nm i ii n n n 

16 4296 CATGCAAAcagGCAGGTACaTCTCCACCtGACaTTATACCTAAgGTtGAAGGcAaaACtATTGCtGAa 

mi iii iiiinii mm i hi uiuiiui i in i ii urn ii 

31 42 33 CATGtAAAgcAGCAGGTACtTGTCCAtCaGACgTTATACCTAAaaTaGAACaTActACcATTGCaGAC 
I I I II III I i llll I I I I II I I 

18 4290 acTtatAtaaAaCAtGTAaacaatCtggtacatgTccACCTgAtgTtGttCcTAaggtggagGgcacC 
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6 4549 CAAATATTAAAATGGGGAAGtrrgGGGGTgTTrrTTGGaCGGTTgGGTATaCCeACgGGttCeCGcAc 

iiiiiiimiiiiiniii ii mil mum mil inn u n n mm 

11 4540 CAAATATTAAAATGGGGAAGcTTAGGGGrmmTGGTGGGTTAGGTArrGGtACAGGggCTGGTAg 

urn i mi ii ii iiimiimiiimm mimim mn iiiii 

33 4336 CAAATtcTtAAATATCGcAGTTTAGGGGTTTTTTTTGGTGGtTTACGTATTGGcACAGGcTCTGGTtC 

mn i iinm iii i ii ii milium mn nm um u m i 

16 4364 CAAATATTAcAATATCGaAGTATGCGTGTaTTTTTTGCTCGGTTACGaATTCGaACAGGgTCgGGTAC 

iiimiii nm milium miiiiiiimi u nm i n n iiiii 

31 4301 CAAATATTAaggTATGGtAGTATGGGTCTtrrTTTTGGTGGGTTgGGtATTGGgtCcGGCTCtGGTAC 

ii i i i um ii m m um 

18 4358 acgtTAgcAgataAaatattgcaatGgtcaagccTTGCTataTTtttgggTGGacttGGCataGGTAC 



6 4617 TGGgGGTCGTaCtGGcTATgTtCCCTTacaAActTCTgCaAAaCCTtCTATTACTaGtGGGCCtatgG 

iii mm i n m i nm n in i ii in u i nm i 

11 4608 TGGcGGTCGTgCaGGgTATaTACCCTTgGGAA^cTCTCCcAAgCCTGCTATTACTgGgGGGCCagCaG 

II II I I II III llll I II I II I III I II I 

33 4404 AGGtGGAaGgACTGGcTATgTACCtaTtGCtACtgacCCaCCtACACCTgCAAtccCctTCCagCCTa 

in m i iiiii iii i u i ii ii ii ii mm u i i iii 

16 4432 AGGcCGACGCACTGGgTATaTtCCatTgCGaACAaGgCCTCCcACAGCTaCAGAtaCAcTTgetCCTg 

ii n ilium iii i ii i i in i iii i mi i ii ii i iii 

31 4369 TGGgGGTCGCACTGGaTATgTcCCteTtaGtACACGtCCTtCTACAGtaTCtGAggCAagTaTaCCTa 

III II I II II I I I I I II I II I I 

18 44 26 TCGcaGTgCtACaGGgggTcgtaCagggtaeAttCcattgggTgggcgtTCcaAt.aCAgtggTggaTg 

con tGGcgGtcGtaCtGCgtaTgtteC-ttgggaAct-ctcc — ctacagctactaatacag-gcc-cctg 

BEll-GAAGCXCXCCCAAGCCXGCXAXX-BEll 
BE12-TATATACCCTTGGGAACCXCXCCCAAGCCXGCXAX-BE12 
Q2 l-TGGGGGTCGTACTGGCTATGTTCCCTTACAAACTTCTGCAAAACCTTCTATTACTAGTGGGCCTATGG-C 

6 4685 CtCGTCCtCCtGTGgTgGTGGAGCCTGTgGCCCCTTCgCATCCaTCtATTCTGTCtTTAATTCAaGAa 

i um ii in i milium mum um it mum mum u 

11 4676 CACGTCCgCCaGTGcTTGTGGAGCCTGTTGCCCCTTCcGATCCCTCCATTGrGTCcTTAATTGAgGAg 

mm u u in u mm uu u i u u um mn u u 

33 4472 TACGTCCtCCgGTtAClCTAGAcaCTGTTGGaCCTTtaGActCgTCTATAGTCTCaTTAATaGAAGAA 

u inn i u um uu u uu u i mum u m i mm 

16 45 0 0 TAAGACCcCCttTaACaGTAGAtCCTGTgGGcCCTTctGAtCCtTCTATAGTtTCTTTAGTgGAAGAA 

i mu u ii i u inn u ii i u u mum i i n mm 

31 44 37 TTAGACCaCCAgTTAgCaTtGAcCCTGTaGGtCCcTTGGACCCCTCTATACTAagTcTTGTTGAAGAA 

iiiii u i i i mi ii in mil i i iiiii i i 

IB 44 94 TTgGtCCtaCAcgTccCccaCtggtTaTtGaaCCtgTGGgCCCCaCagacccAtcTaTTGTTacAttA 

con t-cGtCCtcCagttac-gtaCagccTgTtGgcCCtt-gGa-cCctCtatagtgtcttTa-Ttgaagaa 
BE26-CGXCCXCCGGXXACXGXAGAXA-BE26 JJ2 2-TCTATTGTGTCNTTAATNCAAGAA 

BE2 7-GXCCXCCGGXXACXGXAGACACX-BE2 7 02 2— CGATCCATCTATTGTGTCTTTAATTGAAGAA 
BE28-XCCXCCGGXXACXGXAGACACXGXXGCACCXXXAG-BE2 8 
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47S3 TCgGCaATcATTAAcGCaGGgGCgCC TGAaaTtGTgCCCCC TgCACACGGTGGgTTTAC 

ii n ii inn it ii ii ii iii i ii inn i mi inn urn 

4744 TCtGCTATTATTAAtGCtGGTGCACC TGAggTgGTaCCCCC TACACAgGGTGGcTTTAC 

i i mi i ii ilium i ii urn ii m 

I 4540 ACaAGTTTTATaGAgGCaGGTGCACCA GccCCaTCtATTCC TACACcATCAGGtTTTga 

n iiiiiiii it ii mmm i nil iiiii mm m 

i 4568 ACTAGTTTTATTGATGCTGGTGCACCAaCatCTGtaCCtTCcATTCCcCCagatgtATCAGGaTTTag 

ii i ii mm mm in i it h in n in 

l 4505 tCTCCaaTTgTTCATGTTCCTCC CCCTGCTCCTAtaCCacacCCTCCTacaACATCTGGGTTTCA 

i iii i i i iii mi ii i n n inn mm 

l 4S82 ataGaggactccagTGTgCtTaCatcaggTCCaCCTAggCCtacgttTaCTggcACgTCTGCGTTTGA 

con -ctggta-ttatt-atGctGgtgCacca-ctgctgc-at — c — cccctcct-caccatctGGgTTT-a 

TCTAGTNTTATTAATGCAGGTCCACC - J J2 2 

BE 5-CAXXAACGCAGGGGCGCC XGAA-BE 5 

BE 6-GGCAAXCAXXAACGCACGCCCC-BE 6 

BE7-GCAAXCAXXAACGCAGGGGCGCC XGAAAXXGXGCC-BE7 

05-CTACCCCC TACACAGGGTGGCTTTAC 
022-TCGGCAATCATTAACGCAGGGGCGCC TGAAATTGTGCCCCC TGCACACGGTGGGTTTAC-022 

6 4812 aATtACATCcTCTCAAaCaACTACeCCTCCaATaTTgGATCT aTCaGTT AC tAgTCACACTA 

ii nm mm i iiiii mn n n nut n m n i iiiiiiii 

11 4 8 03 TATaACATCATCTGAAtCgACTACACCTGCtATTTTaGATGT gTCTGTT ACcAaTCACACTA 

i i ii iniii i iiiiiiiiiii iii i nn mm i i 

33 4599 TgTTACTACATCTGCAGATACTACACCTGCaATTATTAATGTttcaTCTGTTggggAgtcatCTATTc 

i until ii i nm iiiiiiii nm i i i n n 

16 4636 T ATTACT AC fcTCa aC t G ATACcAC AC C T G C tATATTA GATattaAtAatACTGTTA 

m mi i i n ii iiiiiiii ii iii m i i nm 

31 4570 cATTgCTACaaCtGCaGAcACaACACCTGC aATTTTA GAT gtaACaAgTGTTA 

II I I I I II IIIIIIII III I II 

18 46 30 tATaaCatCtgCgGgtacaACtACACCTGCggtTTTggatatcacaccttcgtctocctCtgtgtcTA 

con taTtaC — Catctgcag — ACtACaCCTGCiatt'tTt-atg't — catctgtt — tao-act — ta-Ta 

05-TATAACATCATCTGAATCGACTACACCTGCTATTTTAGATGT GTCTGTT ACCAATCACACTA— O 5 
022 -AATTACATCCTCTGAAACAACTACCCCTGCAATATTGGATGT ATCAGTT ACT AGTCAC ACTA-O 2 2 

6 4874 CtACTA GTaTaTTTagAAATCCtgTcTTTACAGAACCtTCTGTAAcACAaCCCCAACCACCcGTG 

i nn ii i m mm i mmimi niim m iiiiiiiiiii m 

11 4865 CcACTA GTgTgTTTcaAAATCCCcTgTTTACAGAACCgTCTGTAATACAgCCCCAACCACCTGTG 

mi i i mini nm nm mm nn n i i i mi 

33 4 667 aaACTATTtCTACACATttAAATCCCACaTTTACTGAACCATCTGTAcTACAcCCTCCAgCgCCTGCA 

m ii mum iiiiiin n nm mmm i n mm i mm 

16 4692 ctACTgTTaCTACACATaAtAATCCCACTTTcACTGAcCCATCTGTATTGCACCCTCCAACACCTGCA 

mm i nm nm nm mnmimmiim nmnn 

31 4623 gCACACATgAaAATCCTACTTTTACTGATCCATCTGTATTGCAGCCTCCtACACCTGCA 

n i mm i in mini him m i in 

18 4 698 tttccacaacCAatttTaccAATCCTgCaTTTtCTCATCCgTCcartaTtgAagtTCCacaAaCTCgg 

can ctactatta-taca— TaaaAATCC-ic-TTtaCtGAaCCaTCtgTaatacAgcctCcaccacCtGc- 

05-CCACTA CTGTGTTTCAAAATCCCCTGTTTACAGAACCGTCTGTAATACAGCCCCAACCACCTGTG-05 
022-CTACTA GTATATTTAGAAATCCTGTCTTTACAGAACCTTCTGTAACACAACCCCAACCACCCGTG-02 2 

027-CTGCA 
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6 4939 GAGGCt^aTGCaCAtATAtTaATtTCTGCaCCcACtgTAACgTCaCAcCcTaTAGAgGAaATTCCttT 

mil i iii ii iii i it inn ii ii mi n n i i im n dim i 

11 4930 GAGGCCAgTGGtCAcATAcTtATaTCTGCCCCaACaaTAACaTCCCAACaTgTAGAAGACATTCCAcT 

ii in m 1 1 i i i iii mi ii i i mii i m mi Ml I 

33 473S GAAGCCtcTGGaCArrTTATAcTTTCrrCCCCtACTgTTAGcACACAAAgTTAtGAAaACATaCCAAT 

iii i ii limn i iii! ii i iii mi inn i until i ii ii ii 

16 4760 GAAACtggAGGgCATTTTAcACTTTCATCATCcACTATTAGtACACATAATTATGAAGAAATtCCTAT 

urn m iiiii minium mini iniiimnm nm inn 

31 4682 GAAACatCAGCTCATTTAcTACTTTCATC ATCatCTATTAGcACACATAATTATGACGAAATACCTAT 

ii iiiii iiiii it ill ii linn minimum i 

18 4766 GAggtggCAGGTaATgTAtTtgTTggtaCecCtaCatcTgGaACACATgggTATGACGAAATACCTtT 

con GA-gc — c-GGtcAttTa-ta-TttcttC-cC-aCtattaq-aCaCAtaat-tafcGA-gAaAT-CCtaT 

05-GAGGCCAGTGGTCACATACTTATATCTGCCCCAACAATAACATCCCAACATGTACAAGACATTCCACT-05 
022-GAGGCTAATGGACATATATTAATTTCTGCACCCACTGTAACGTCACACCCTATAGAGGAAATTCCrM-022 
027 -GAAGCC TCTCG AC ATTTTATATTTTCTTCC CCTACTG TTACC ACAC AAAGTTATG AAAACATAC CAAT- O 2 7 

6 5007 AGAtACTTTTGTgGTATCaTCTAGTGATAGcCGtCCTACATCCAGTACcCCTgTTCCTgGTaCTgcaC 

iii mum nm milium n niinuiniu in nm n u i 

11 4998 AGAcACTTTTGTTGTATCCTCTAGTCATAGTGGaCCTACA'XCCAGTACtCCTcTTCCTcGTgCTtttC 
II II IIUIIII III I llll IIIII II II II llll I II I 

33 4 80 3 GGATACcTTTGTTGTTTCCACAgACagTAGTAataTAACATCaAGCACgCCCATTCCAGGGTCTCGCC 

nun in nm iiii ii ii i iiiii nm inn imiunmi 

16 4 828 GGATACATTTATTGTTagCACAAACccTAAcAcaGTAACtAGTAGCACACCCATaCCAGGGTCTCGCC 

niiumuiini ii ii in im niniiumii nun im 

31 4750 GG AT AC ATTTATTGTTTCTACTAAt aaTGAaAAcaTAACa AGTACCAC ACCCATtCCAGGGGTGCGC C 

i mm i mi n n i i mn n n i n nm i 

18 4 834 acAaACATTTgcTtcTTCTggTAcggggGAggAacccAttAGTAGtACcCCatTgCCtactGTGCGgC 
con -gAtACaTTTgttgtttccactaatgata aac-aAca — tAG-AC-CCcaTtCC-gg-gctcgcC 

OS-AGACACTTTrGTTGTATCCTCTAGTGATACTGGACCTACATCCAGTACTCCTCTTCCTCGTGCTrrTC-OS 
02 2-AGATACTTTTGTGGTATCATCTAGTGATAGCGGTCCTACATCCAGTACCCCTGTTCCTGGTACTGCAC-O2 2 
027-CCATACCTTTGTTGTTTCCACACACACTAGTAATGTAACATCAAGCACCCCCATTCCAGGGTCTCGCC-O2 7 

6 50 75 CTCGGCCTCGtGTCGGCcTaTATAGTCGTGCaTTgCAcCAGGTgCAGGTTACaGACCCtGCaTTTcTt 

iiniiiiii iiiii i minimi u u mn ilium nm u in i 

11 5 0 6 6 CTCGGCCTCGgGTGGGTTTgTATAGTCGTGCcTTaCAgCAGGTACAGGTTACgGACCCcGCgTTTTTg 

u i i ii i mn mum i mum i i 

3 3 4 871 CTGTGGCACGCCTtGGTTTATATAGTCG CAAtACcCAACAGGTTA AGGTTGTtGACCCTGC 

i iimumi ii iiiuuim it it minim i mn mum 

16 4896 CaGTGGCACGCCTAGGaTTATATAGTCG CACaACACAACAGGTTA AAGTTGTaG ACC CTGC 

IIIII llll IIIIIIIU I IIIUIII II I IIIII I II II I 

31 4818 GTcctGCACGTtTAGGgTTATATAGT AaGGC t ACACAACAAGTaA AAGTTaTTGAtCCaaC 

ii ill ii iiii iii mini m i i ii 

18 4902 GTgtaGCAgGTccccGccTtTAcAGT AgGGCctacCAACAAGT gtcAGTggcTaAcCCtga 

con ct-tqgCacGtct-gG-tTaTAtAGTcgtgc-atg a— caaCAgCT^aca-gttgttga-cctgc 

OS-CTCGGCCTCGGGTGGGTTTGTATAGTCGTGCCXTACAGCAGGTACAGGTTACGGACCCCGCGTTTTTG-05 
022-CTCGCCCTCGTGTGGGCCTATATAGTCGTGCATTGCACCAGGTGCAGGTTACAGACCCTGCATTTCTT-O2 2 
02 7 -CTGTGGCACGCCTTGGTTTATATAGTCG CAATACCCAACAGGTTA AGGTTGTTGACCCTGC-02 7 
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6 S143 TCCACtCCtCAaCGcTTaaTtACaTAT GAtAACCCTGTaTATGAA GGgGAgGATG 

inn n ii n ii i ii iii n iiiinii iniii n n mi 

11 S134 TCCACgCCaCAgCGaTTggTAACTTAT CACAACCCTGTcTATGAA GGaGAaGATG 

. i i mini n n mi i mi i n i 

33 4932 TTTTtTAACatCgCCTcaTAAACTTATaACATATOATAATCCTGCATtTGAAaGctTtGAccctGAaG 

mi mi i ii ii mm i i m 1 1 1 1 1 ti ii 1 1 1 1 1 1 1 mi i i n n 

16 4957 TTTTgTAACcaCTCCcAcTAAACTTATTACATATGATAATCCTCCATATGAAgGTaTAGATCtgGAta 

iii i i mi i i ii minimi n mn mm i n in n 

31 487 9 GTTTCTTAgtgCTCCAAaacAgCTAATTACATATGAaAACCCTCCcTATCAAacTgTAaATGCtGAaG 

in mi mi immmm nm n i 

18 4963 GTTTCTTAcacgTCCAtcctcttTAATTACATATGAcAACCC gGCCtttG 

con ttttct-accacteefcfca-taacttATtacatatGAtAAeCCtgcatatgaaagt-taga-gc-gatg 

OS-TCCACGCCACAGCCATTCGTAACTTAT GACAACCCTGTCTATGAA GGAGAAGATG- 05 

02 2 -TCCACTCCTCAACGCTTAATTACATAT GATAACCCTGTATATGAA GGGGAGGATG-022 

027-TTTTTTAACATCGCCTCATAAACTTATAACATArGATAATCCTGCATTTGAAAGCTTTGACCCTGAAG-02 7 



6 5198 

11 5189 

33 5000 

16 5025 

31 4947 



TfcAGTgTACAATTTAgtCATGAtTCTA 

i m niimii nm mi 

TaAGTTTACAATTTAccCATGAgTCTA 
I I M I I I II I III II 
AcACATTACAATTTcaaCATAgTGATA 

.'.HULL! n nm 



T aC AC AATG C AC CTGATG AgGC t TTTATG CAcATa 
I 1 1 1 i I I I i I I I i M I I I II lllllill II 
rcCACAATGCACCTGATGAaGCaTTTATGGATATT 

i iii nun i m i mm 

TatcaccTGCTCCTGATCCTGACTTTcTaGATATT 

nm 1 1 n 1 1 1 1 1 1 1 1 i nm 



AtACATTATAtTTTTCtagTAaTGATAatagtaTTAATATAGCTCCaGATCCTGACTTTtTgGATATa 

i i mn mn iii mimii n mn mm i nm 

AatCTTTATACTTTTC caATAcatCgC aTAATATACCcCCTGATCCcGACTTTcTaGATATT 

i ii i ii I i i i i n i i linn i ii in i mm 

18 5013 AgcCTgTggACacTaCattaacattTgatcCtCgTAgTgatGttCCTCATtCaGAtTTTaTgGATATT 
:on a-acttTacAattTae-eataattaTaat-ctcttaataatGctcctGATec-GacTTTaTgGAtATt 

OS— TAAGTTTACAATTTACCCATGAGTCTA TCCACAATGCACCTGATGAACCATTTATGGATATT-05 
022-TTAGTGTACAATTTAGTCATGATTCTA TACACAATGCACCTCATGACGCTTTTATGGACATA-022 
02 7-ACACATTACAATTTCAACATAGTCATA TATCACCTGCTCCTGATCCTGACTTTCTAGATATT-02 7 

6 5260 ATTcGttTgCAcAGACCtGCcATtgCGTCCcGACG tGGcCTTGTGCGgTacAGTCGCATTGGaCAACG 

e , e m i i ii nm ii n mn nn n mnm i mi mn 

11 5251 ATTaGAcTACATAGACCaGCTATaACGTCCaGACGgGGTCTTGTGCGTTTTAGTCGCATTGGgCAACG 

„ in i mini n nm n n i i i imimmii i mini 

33 5062 ATTGCATTACATAGGCCtGCTATtACaTCTcCtaGacaTacTGTGCGTTTTAGTACAgTaGCTCAAAA 

. n nn iiiiimm ii i ii in i i i in nun 1 m i n 

16 509 3 gTTGCtTTACATAGGCCaGCatTaACCTCTaGgcGtAcTggcaTTAGgTAcAGTAGAaTtGGTAATAA 

„ 5nna i n imiiimi ii i mn i i i mi n mm i iiiiim 

3 1 5009 ATaGCaTTACATAGGCCTGCccTtACCTCacGtaGgAacACTGTTAGaTATAGTAGAcTAGGTAATAA 

,„ ii iiiimiiim i ii ii i i mm i i mini nm i 

18 5081 ATccgtcTACATAGGCCTGCttTaACaTCcaGgcGtgggACTGTTcGcTtTAGTAGAtTACGTcAacg 

:on aTtg — tTaCAtAGgCCtGCtaT-aC-TCc-G-cGtggtactgT-cG-T-tAGTaGaaT-GGtcAa — 

JJ24-TACATAGGCCTGCTATAACNTCCAGNCGTGGTNNTGTGCGNTTTAGTAGA-JJ24 
05-ATTAGACTACATAGACCAGCTATAACGTCCAGACGGGGTCTTCTGCGTTTTAGTCGCATTGGGCAACG-05 
016-ACTGTGCGTTTTAGTAGAGTAGCTCAAAA-016 
022-ATTCGTTTGCACACACCTGCCATTGCGTCCCGACGTGGCCTTGTGCGGTACAGTCCCATTGGACAACG-02 2 
027-ATTCCATTACATAGGCCTCCTATTACATCTCCTACACATACTGTCCGTTTTAGTAGAGTAGGTCAAAA-02 7 
028-GTAGACATACTGTGCGTTTTAGTAGAGTACGTCAAAA 
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6 5328 GGGGTCtATGcACACtCGCAGcGGAaAgCAeATACGgGCCCGCATtCATTATT 
llflll III IIII Hill III I II lllll IIIIIIII lllllll 
11 5319 CGGGTCcATGfcACACaCGCACTGGAcAACAtATAGGfcGCCCGCATACATTATT 
I I I I II IIIIIIII IIII II II II I IIIIIIIIII 
33 5130 AGccACACTtaAaACTCGCAGTGG tAAACAaATtGGaGCTAGaATACATTATTATcAgGATTTAAGTc 

I inn inn iiiii iii ii ii mi milium i iimiin 

16 51S1 ACAAACACTaCGtACTCGTAGTGGaAAAtCTATaGGTGCTAaGGTACATTATTArrATGATTTAAGTA 

nun i ii minimi mi inn i in nimiimmi i mi 

31 5077 ACAAACTtTGCGcACTCGTAGTGGTqCtaCTATtGGTGCaAGGGTqCATTATTATrATGATATt AGTA 

IIII II II II II III I II lllll lllll II I IIII lllllll III 

18 5149 ggcAACTaTGtttACcCCcAGcGGTaCacaaATaGGTGCtAGGGTtCAcTtTTATcATGATATaAGTc 

con -g-aaCtaTgcacACtCGcAGtCC-aaacatATaCGtGCtagg-TaCAtTaTTatcatgatataagta 
05-GGGGTCCATGTACACACGCACTCGACAACATATAGGTGCCCGCATACATTATT ( -05 ) 
016-AGCCACACTTAAAACTCGCAGTGGTAAACAAATTGGAGCTAGAATACATTATTATCAGGATTTAAGTC— 016 
022-GGGGTCTATGCACACTCGCACCGGAAAGCACATAGGGGCCCGCATTCATTATT(-022) 

027- AGCCACACTTAAAACTCGCAGTGGTAAACAAATTGGAGCIAGAATACATTATTATCAGGAITTAAGTC-O2 7 

028- ACCCACACTTAAAACTCGCACTGGTAAACAAATTGGAGCTAGAATACATTATTATCAGGATTTAAGTC-02 8 

6 5381 TTfcAtGAtATTTCACCtaTTgCACAgGCTGCAGAaGAAATAGAAaTGCACCCTCTtGTGG 

ii i n iiiiiiii ii mi iiiiiiii iiiiiiiii iiiiiiiiii mi 

11 5372 TTcAgGAcATTTCACCagTTaCACAAGCTGCAGAgGAAATAGAAcTGCACCCTCTAGTGG 

I I III I I II II HI I II 

33 5198 CTATTG TgeCtttAGAcCACaccgTgCcAAAXgaACAAtaSgAATtAeAgcCTttaCaTgAtacT 

mm i iii ii ii i i i i i i ii i i 

16 522 9 CTATTCATCCTGCAGaAGAAAtagaatTACaAAcTatAacAccTtCtaCAtAt ACTACcACTTCacaT 
IIII IIIIIIIII IIII I III lllll lllll I 

31 5145 gTATTaATCCTGCAGgtGAAAgTATTGAAaTGCAaCCTTTAGgggCgTCTGCaACTACtACTTCtacT 

iiii m i iii mini iiii mini lllll I 

18 5217 cTATTgcaCCTtCcccaCAAtaTATTGAAcTGCAgCCTTTAG taTCTCC caCggag 
eon ctattgatc-t-c--- agaacacat — taca-aagct-caa — g-aatc— -aa-ctaccc — teg — - 

<O5-)TTCAGGACATTTCACCAGTTACACAAGCTGCAG-0S 
016-CTATTG TGCCTTTAGACCACACCGTCCCAAATGAACAATATGAATTACAGCCTTTACATGATACT-Ol 6 
02 2- TTTATGATATTTCACCTATTGCACAGGCTGCAC-022 

027- CTATTG TGCCTTTAGACCACACCCTCCCAAATGAACAATATGAATTACAGCCTTTACATGATACT-02 7 

028- CTATTG TGCCTTTAGACCACACCGTGCCAAATGAACAATATGAATTACAGCCTTTACATGATACT-02 8 

6 54 41 CTGCAcAggATGAtACaTTTGATATTTATGCTGAAtCtTTTGAaCCTggCatTaACCCTacCCAACAc 

iiiii i iiii ii 1 1 1 1 1 1 1 1 m 1 1 1 m 1 1 f i iiiii iii i i inn mm 

11 54 32 CTCCAgAaaATGAcACGTTTGATATTTATGCTCAAcCATTTGAcCCTatCccTgACCCTgtCCAACAT 
I I II I II I III 

33 52 63 tCtaCaTCgtCTtaTaGTATTAATgATGG tTTgTATGATgTTTATCC TgaCgAtGT 

i i ii ii i iiiiiii iiii ti mm mini 1 i 

16 5297 gCAgCcTCacCTacTTcTATTAATaATGGA TTaTATGATaTTTATGCaGATgacttTattACAGA 
I I I I I III I IIII II II III I III I 

31 5213 ttAAATGAtggCTTaTaTGAcATTTATGCAGA CAcTCATtTTaCTGTGGATacACCTgCcACAcA 

inn m i m i! iiiiiiii ii ii ii m i iii i 

18 5273 gacAATGA CTTgTtTCAtATaTATGCAGAtgaCAtgCAcccTgCaGTCccTgtACCatCgcgttc 

con -t-aa — a-a tat-T-ttAt-taTg-acag-ac-atgatatt — tgc taccctt-ccaa 

016 -TCTAC ATCGTCTTATAGTATTAATGATGG TTTGTATCATGTTTATGC TGACCATGT-016 
0 2 7 — T C TAC AT C GT C TT AT AGT ATT AATG ATG G TTTGTATCATGTTTATGC TCACGATGT-02 7 

028 -TCTACATCGTCTTATAGTATTAATG ATGC TTTGTATCATGTTTATGC TGACGATGT— 028 
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6 5509 cCTGTTACAaatatatcagAtaCaTATtTaACtTCCACACCTAATACagTTaCACAAcCGTGGGGTAA 

mum i i in i it iiiimmiiii n mn imimii 

11 5500 tCTGTTACA cAgtCtTATCTtACcTCCACACCTAATACCcTTtCACAAtCGTGGCGTAA 

I I 11 I I I I I I I II 

33 5319 ggaTaaTgtAcaC^CcecAAtgCaacaCTCATacAgtaCgTTtgCAaCaacaCgTACcaGcAATCTgt 

I I I I I I I I I I II II II III 

16 5362 TAcTtcTaCAaCCcCggtAcCatctgtacCCTCtACatCTTTaTCAGgtTATaTTCCTgCAAATACaA 

II I I I I I I II I I II Ml I INI III I I 

31 527 8 TAaTgtrTCCeCTtCtacTgCTgtAcAgTCCaCatCTgCTgTgTCTGCCTATgTaCCTACAAATACcA 
II I III I I I I I I II I II lllllll 1 II III I 

18 5338 TAcTaccTCCttTgCattTttTaaAtAtTCgcCcaCTatatctTCTGCCTcTtcctaTAgtAATgtaA 



con 



ta-tttt-ca tctc — attca teacctac — c-ttatcagcct-tc-ca-tagtaatgtaa 

016-GGATAATGTACACACCCCAATGCAACACTCATACAGTACGTTTGCAACAACACGTACCAGCAATGTGT-016 

027- GGATAATGTACACACCCCAATGCAACACTCATACAGTACGTTTGCAACAACACCTACCACCAATGTGT-02 7 

028- G5ATAATGTACACACCCCAATGCAACACTCATACAGTACGrrTGCAACAACACGTACCAGCAATGTGT-02 8 



6 5577 CACCACAGTtCCATTGTCAcTtCCTAaTGACC tGTTTtTaCAaTCTGGcCCTGAtATAACTTTTCCTA 

iiiiini iiimm i mi mi mi i n inn inn nmmmn 

11 5559 tACCACAGTcCCATTGTCAATeCCTAgTGACTGGTTTGTgCAGTCTGGgCCTGACATAACTTTTCCTA 

II I II II lllllll II I I M I III I 

33 5387 CtATACCTTTaaATacaGgATttgATAcTCCTGTTaTGTCtCGcCCTGATATACCtTccCcTTTAtTT 

I ii urn i i ii in mi i i ii n uiinniii i n i 

16 5410 CaATtCCTTTtgCTggtCcATacaATATTCCTtTAgTaTCaCCtCCTGATATACCcATtaATaTAaCT 

i i u i ii ii i mn ii i ii n mm mi u i i 

31 S346 CtGTgCCacTAAGTaCaGgTTttGAcATTCCcaTATTtTCtGGgCCTGATgTACCtATagAgCATgCA 
I II II III I II II I II III I II llllll I I III I 

18 5406 CgCTcCCttTAAcctCctcTTggCAtgTgCCtgTATacaCgGGtCCTGAT AttacattAcCATctA 

:on ca-taCctttaaatt-tg-aTtcgatat-cCtgt-tt-tc-ggtCctGat-taccataacottt-cta 

O27-CTATACC-rfrAAATACAGGATTTGATACTCC-rCTTATGTCTGGCCCTGA-rATACCTTCCCCTTTATTT-02 7 
02 8-CTATACCTTTAAATACAGGATTTCATACTCCTGTTATGTCTGGCCCTGATATACCTTCCCCTTTATTT-02 8 



6 5645 

11 5627 

33 5455 

16 5498 

31 5414 



CTGCAcCTATGGGAACACCCTTTAGTCCTGTAACTCCTGCTTTACCTACAGGCCCTGTTTTcATTACA 

Mill 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 II I ! f H I! 1 1 i 1 1 II 1 1 1 M M 1 1 1 1 II 1 1 1 M llllll 

CTGCAtCTATGGGAACACCCTTTAGTCCTGTAACTCCTGCTTTACCTACAGGCCCTGTTTTTATTACA 



I I I 

CcCacAtCTaGccCATTtgT 

i i ii mi i 

gaCcaAgCTccTTCATTAaT 
I II I I 

CctaCACaCgtTTtCCCATT 

I I I I I mm 

CtacCtCtGtaTggCCCATTg 



mi i i in in in mi ii u i 

TCCTATttCgCCTttTTTTCCTtttGACACcATTgTTGTaGAc 

mm u i m i n m in n 

TCCTATaGttCCagggTCTCCACAAtAtACaATTaTTGcTGAT 

mi i i ii i mm i in m mi 

TCCTtTGGCCCCTaCaaCgCCACAAgtGTCTATTtTTGTTGAT 

ii imin i i im ii mi i i ii 

cGGCCCCTgCctCtaCACA GTaTATTggTaTacAT 



C-caag-c-ctaTTttt-ttgat 



ct-c-act-tgtg-ac-a-ttttagtCCtatagctCCtgctt-t 
3E2 2-CCCAXXGXAXCACCCACCCCCC-BE22 
BE23-CCCAXXGXAXCACCCACCGCCCCXGCCXCXACACA-BE23 
016 -CCCACATCTAGCCCATTTGT TCCTATTTCGCCTTTTTTTCCTTTTGACACCATTGTTGTACAC -0 i 6 

027 -CCCACATCTAGCCCATTTGT TCCTATTTCCCCTTTTTTTCCTTTTGACACCATTCTTGTAGAC-02 7 

02 8 -CCCACATCTAGCCCATTTGT TCCTATTTCGCCTTTTTTTCCTTTTGACACCATTGTTGTAGAC-028 



SO 



55 



EP 0 477 972 B1 



6 5713 GCTTCTCgaTTtTATTTCCATCCTgCATCGTAtTTTGCACGtAaACGCCGTAAACGTATTCCCTTATT 

mini ii iimmnii mini iiimn i iiimimimiiimim 

11 5695 GGTTCTGACTTcrATTTGCATCCTACATCGTAcTTTGCACGcACACGCCGTAAACGTATTCCCTTATT 

m mini m inmi i i n m i i iiimm mi i n 

33 5518 GGTGCTGACTTTgtTTTACATCCTAGTTATTttATTrTACGtcGcaGgCGTAAACGTTTTCCATATTT 

i i mini i m 1 1 1 1 u 1 1 1 1 til i ti mn i minimi imini 

16 55 61 GcaGGTGACTTTTATTTACATCCTAGTTATTAcATGTTACGAaaaCGACGTAAACGTTTACCATATTT 

i mn mimi ii iiiiinnii nun i iminmn n mini 

31 547 7 GGgGGTGATTTTTATTTGCAcCCTAGTTATTATATGTTAaaAcgtCGACGTAAACGTGTAtCATATTT 

n m inmi n imii i i minium i inn 

18 553 7 GCtacacATTaTTATTTGtggCCattaTATTATtTtaTtcctaagaaACGTAAACGTGTtcCcTArrT 

con GgtgctgacTtttaTTTgcatCCtag-TatTat-Ttttacgta-acgaCGTAAACGT-TtcC-TatTT 

J J2 5 -CGTAAACGTNTTCCCTATTT 
PCR2 -CGTTTTCCATATTT 
027-GGTGCTGACXTTCGTTTTACATCCTAGTTATTTTATTTTACGTCGCAGGCGTAAACGTTTTCCATATTT-027 



5 5781 TTTTtCAGATGT GGCGGCCTAGCGACAGCACACTATATGTGCCTCCTCCtAACCCTGTATCC 

mi inmi innmnmiiiiimmiimnmni mnimni 

l 5763 TTTTACAGATCT CGCGGCCTACCCACAGCACAGTATATGTCCCTCCTCCcAACCCTCTATCC 

miinmn nmmn n mm n mum imini 

i 558 6 TTTTACAGATGTCcgTgTGGCGGCCTAGTGAGGCCACAGTgTACcTGCCTCCT GTaCCTGTATCT 

mi 1 1 1 1 1 1 1 1 i mi minimum n m mimi n n mm 

J 5629 TTTTtCAGATGTCTCTtTGGCtGCCTAGTGAGCCCACTGTCTACTTGCCTCCT GTCCCAGTATCT 

mi minimi mi mm inn minimi n in imini m 

t 554 5 TTTTaCAGATGTCTCTGTGGCGGCCTACcCAGGCTACTGTCTACTTaCCACCT GTCCCAGTGTCT 

mi mm n mimmiii n in n n i mm i m i 

3 5605 TTTTgCAGATGgCTtTGTGGCGGCCTAGtCAcaaTACcGTaTAtcTtCCACCT ccttCCGTGgCa 

, TTTTaCAGATCfcetctgtGGCgGCCTAG-GA — ccACaGTaTA — TgCCtCCTcc-gtecCtGTatCt 

TTTTNCAGATGTCTNTGTGGCGGCCTAGTGA-JJ25 
PCR1-CAGATGTCTCTGTGGCGGCCTAGTG-PCR1 
TTTTGCAGATG-PCR2 

027-TTTTACAGATGTCCGTGTGGCGGCCTAGTGAGGCCACAGTGTACCTGCCTCCT GTACCTGTATCT-02 7 



6 584 3 AAaGTTGTTGCCACGGATGCtTATGTTActCGCACCAACATATTTTATCATGCCAGCAGTTCTAGACT 

II mmiiimiiin It It I t I t I t I I I I t 1 I I I t I 1 t I I I II I ! II I I M > I f I ! f It I 
11 5825 AAgGTTGTTGCCACGGATGCgTATGTTAaaCGCACCAACATATTTTATCATGCCAGCAGTTCTAGACT 

ii mn in mi mn mn i m i mi 1111 i mn mn 

3 3 5 651 AAaGTTGTcAGCACtGATGAATATGTgtCtCGCACAAgCATtTATTATtATGCtGGtAGTTCCAGACT 

ii mn mn mi nun i mini in mm mi n i immi 

16 5 694 AAgGTTGTAAGCACGGATGAATATGTtgCACGCACAAACATATATTATCATGCAGGaAcaTCCAGACT 

ii immiiiiiiiiiiiiiiii mi n imiimnm inn i inn 

31 5 610 AAAGTTGTAAGCACGGATGAATATGTaACACGttACcAACATATATTATCAcGCAGGCACtgCTAGgCT 

i mum ii mn nm n i n i mn mm n mn mi i 

18 5 670 AgAGTTGTAAatACcGATGAtTATGTgACtCccACaAqCATATtTTATCAtGCtGGCACctCTAGatT 

con AaaGTTGTaagcACgGATGaaTATGTtac-CgcAC-AaCATaT-TTATcAtGC-gGcAgttCtAGacT 
JJ2 6-GTTGTNANCACGGATGANTATGTTACTCGCACAA- J J2 6 
PCR3-AAGTTGTAAGCACCGATGAATATGT-PCR3 
LCaiA-AAGTTCTAAGCACCCATCAATATGT-LCRlA 

LCR 1 3 -TGC ACGCACAAAC ATATATTATCA- LCRB 
LCR1B ' -ACGTGCGrGTrTGTArATAATAGTA-LCRB ' 
027-AAAGTTGTCAGCACTGATGAATATGTGTCTCCCACAAGCATTTATTATTATGCTGGTAGTTCCAGACT-027 
6 5911 t CTTG CaGTGGG ACATCC tTATTttTCcATaAAA cggGcTAA C AAAA CtGTTGTgC 
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inn minimi im n n in i m 

S893 cCTTCCTGTCGGACATCCATATTacTCrATcAAAAA AGtTAA 

mum n iiiiinm mn urn m 

5719 tCTTGCTGTTGGcC ATC CATATTTTTCT ATT AAAAA t C CTA.C TAA 

mn iiiiuiiii mm minim mi if 

5762 aCTTCCAGTTGGaCATCCcTATTTTcCTATTAAAAAACCTAACAAT 

m mi ii mn mi i i n m n urn n mi 

56 78 gCTrACAGraGCcCATCCATATTaTtCcATacctAAAtCTGACAArecTAAaAAAA 
i II 11 Li HI 111 111 I III I [ II I 



5738 atTaACtGTtGGtaATCCATATT 



i im i mn i 

C AAAA CAGTTGTAC 

I llll I I Mil 

CgetAAAAAAtTATTgGTAC 

mi mi it i 

AAcAAAA TATTaGTtC 
M I II I 
TAGTTGTaC 
llll 



TtagggttcCTGcaggTggTggcAAtAagcagGaTaTtC 



-cTtgC-GTtGGacATCCaTATTtttctaTtaaaaaacctgctaat — caacaaaAaa-tagttgTaC 
JJ27-GTTGGACATCCATATTTT-JJ27 
027-TCTTGCTGTTGGCCATCCATATTTTTCTATTAAAAATCCTACTAA CGCTAAAAAATTATTGGTAC-027 



6 5967 CAAAGGTGTCaGGATATCAATAcAGgGTaTTTAAGGTgGTGTTaCCAGATCCTAACAAaTTTGCATTg 

iiiiuiiii immiiM it n iiiiiiii mn miiimimi iinnii 

11 5949 CAAAGGTGTCtGGATATCAATATAGaGTgTTTAAGGTaGTGTTgCCAGATCCTAACAAgTTTGCATTa 

i ii ii ii ii i iimui II llll III II I I I 11 I I I I M II llll III 

3 3 5784 CcAAAGTATCAGGcTTgCAATATAGCCTtTTTACGGTcCgTTTACCAGATCCTAATAAaTTTCGATTT 

i mmmu n inn mn mn i i mm m n nm nm m 

16 582 4 CtAAAGTATCAGGATTACAATAcAGGGTATTTAGaaTaCaTTTACCtGAcCCcAATAAgTTTGGtTTT 

i ii ii mumiiuii i m i ii ii i n i i mm n n n n um in 

31 574 3 CaAAGGTgTCAGGATTACAATATAGGGTATTTAGCGTtCgTTTACCaGAtCCAAAcAAATTTCGaTTT 

i um ii i ii iimui uiunim i um n mn mum n 

18 5800 CtAAGGTtTCtGcATacCAATATAGaGTATTTAGGGTgCagTTACCtGAcCCAAAtAAATTTGCtTTa 

con CaAAgGTgTCaGgaT — CAATAtAGgGTaTTTAgggT-c — TTaCCaGAtCCtAA-AAarTTGgaTTt 

JJ28-CAATATAGGGTATTTAGGGTNCNGTTACC-2 8 30-AATAAATTTGGATTN 
PCR4-CrTArArCCCJirAAATCCCArGTTAA-PCR4PCR5-rrATTTAAACC:AAAA 
027-CCAAAGTATCAGGCTTGCAATATAGGGTTTTTAGGGTCCGTTTACCAGATCCTAATAAATTTGGATTT-027 



6 60 3 5 CCTGAcTCgTCtCTfl-TcGAtCCCACaACACAaCGTTTAGTAIGGGCaTGCACAGGccTaGAGGTgGG 

mn ii u n ii ii um um imnimim mitni i um n 

11 60 17 CCTGAtTCaTCCCTgTTTGACCCCACTACACAgCGTTTAGTATGGGCgTGCACAGGgtTgGAGGTAGG 

imi i um i i uu nun ii minimi u m i n mi 

33 5852 CCTGACACCTCCTTTTATAACCCtGATACACAaCGa-ITAGTATGGGCaTGTGTAGGccTTGAaaTAGG 

minimi mum u mum u i u imi mum uu mi 

16 5892 CCTGACACCTCaTTTTATAATCCaGATACACAgCGgcTgGTTTGGGCCTGTGTAGGTgTTGAGGTAGG 

mil ii u milium n n u n i imimmiu m i iiiiiiii 

31 5811 CCTGATACaTCTTTTTATAATCCTGAAACtCAACGcTTAGTTTGGGCCTGTGTTGGTtTaGAGGTAGG 

mum i 1 1 1 1 1 ( 1 1 1 1 1 ii i ( i mn um iiiiuiiii m in i u 

18 5 868 CCTGATACtagTaTTTATAATCCTGAAACaCAACGtTTAGTgTGCGCCTGTGcTGGagTgGAaaTtGG 

con CCTGA-aC-tcttTtTataAtCCtga-ACaCAaCGttTaGTaTGGCCcTGtg-aGG— T-GAggTaGG 

CCTG ACACCTCNNTTTATAAT- JJ 3 0 
GGACTGrCGA-PCRS 

OZ7-CCTGACACCTCCTTTTATAACCCTGATACACAACCArrAGTATGGGCATGTGTACGCCTTGAAATAGG-027 
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6 6103 CAGGCGaCAgCCaTTAGGtGTgCGTGTaAGTGCaCATCCtTTcCTAAAtAAATATGATGATGTtCAAA 

huh ii ii urn ii nut inn mn n him mimiiiiiii mi 

11 6085 CAGGGGtCAaCCtTTAGGCGTTGGTCT^GTGGgCATCCaTTgCTAAACAAATA'TGATCATQTaGAAA 

ii ii ii ii minimi i mn mn n imiim nun im 

33 5920 TAGaGGgCAGCCATTAGGCGTTGGCATaAGTGGtCATCCTTTATTAAACAAArTTGATGACACtGAAA 

i i ii minimi ii urn mn immimm nm mum mi 

16 5960 TCCtCGtCAGCCATTAGGTGTgGCCATTAGTGGcCATCCMTATTAAATAAATTgCATCACACaGAAA 

iii ii iii mi iii in ii mum nm iiimmmii mimi mi 

31 5879 TCGCCGgCAGCCATTAGGTGTaGGtATTAGTGGtCATCCATTATTAAATAAATTtGATCACACTGAAA 

ii ii nm mum u mini mum i mum mmimui 

18 59 3 6 cCGtGGtCAGCCtTTAGGTGTtGGccTTAGTGGgCATCCATTtTatAATAAATTaGATGACACTGAAA 



6 6171 AT tCaGGGagTCGTGGTAAcCCTGGaCAGGATAAcAGGGTTAATGTAGGTATCGATTATAAACAA 

ii in iiuuiii tun mum mmmimmmumnun 

11 615 3 ATaGTggtGGGTATGGTGGTAAtCCTGGTCAGGATAATAGGGTTAATGTAGGTATCGATTATAAACAA 
II llll III I II III Ullllllll I 11 I i I 1 I H 1 I I I I I I I 

33 59 88 ceGGTAacaaGTATcCTGGAeAACCgGGTGctGATAATAGGGAATGTtTATCcATGGATTATAAACAA 

I ii m i i i i I mi mum umi mi muni mm 

16 60 28 AtGCTAgTgctTATGCaGcAaATgCaCGTGtgGATAATAGaGAATCTATATCtArCGATTAcAAACAA 

i m i urn i i i ii mimi milium iiiinn mm 

31 59 47 AeTCTAATagaTATGCcGgTggTcCTGGcactGATAATAGGGAATGTATATCaATGGATTATAAACAA 

ii ii iiii n mm I i u i mum n 

18 6004 gtTCccATgccgccaCgtcTaaTgtTtctgagGAcgtTAGGGAcaaTgTgTCtgTaCATTATAAgCAg 



6 6 236 ACaCAatTATGcATGGTtGGaTGTGCcCCcCCtTTgCGcCAgCATTCGGGTAAaGGTAaACAgTGTaC 

ii ii mi mn ii mn ii ii ii n n iiuuiun mi in m i 

11 6221 ACcCAGcTATGTATGGTgGGcTGTGCtCCaCCgTTAGGtGAACATTGGGGTAAgGGTAcACAaTGTtC 

ii in mn i i ii m u ii mi uiuunmn m in i 

33 605 6 ACACAGTTATGTTTAcTTGGaTGTAAgCC tCCa AcAGGGGAACATTGGGGTAAAGGTg ttgCtTGTAC 

mn ii iiim im u ii ii u i iiuuiii mn urn i mn 

16 6 096 ACACAATTGTGTTTAaTTGGTTCCAAACCACCTATAGGGGAACAcTGGGCcAAAGGatccCCaTCTAC 

1 1 1 1 1 1 mum 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 ii u n mn nm u mi 

31 6015 ACAC AAcTGTGTTTAcTTGGTTCCAAACCACCTATTCGaGAgCA tTGGGGTAAAGGtAgTCCTTGTAg 

nm i in i i u ii ii mini u u uu mm i i nun 

18 6 072 ACACAgtTaTGTaTttTgGGcTC tgccCC t gCTATTGGgC AaCAcTGGGcTAAAGCcAcTgCTTGTAa 
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6 6304 tAATACacCTCTACAggcTCGTGACTGCCCgCCcTTaGAACTTATTACCACTGrrATACAGGATGGca 

iiiii mini iiiimmii ii u iiiiiiiiiiiiiiiiiiniiiiiiiii i 

1 $299 «AATACCtCTGTACA*a»TGGTGACTGCCCcCCgTTgGAACrrATTACCAGTGITATACAGGATGGaG 

it I i ii ill ii ii ii ii iiiiiiii i i i mi mini i 

3 6124 TAarCCAGcACcTgCcaaTCATTGTCCACCtTrAGAACTTATAAAtACTaTTATTgAQGATGGTG 

i iiiii i i i minnim in i nm n inn imimi 

5 SI 64 cAATgtTGCAGTAAaTCCaGGTGATrGTCCACCATTAGAgTTAATAAAcACAGTTATTCAGGATGGTG 

ii ill i i n minimi iiiiiiii ihi in mini n nm i 

1 6083 tAAcaaTCCTaTtACcCCtGGTGATTGTCCtCCATTAGAATTAAAAAAttCAGTTATaCAAGATGGgG 

mm m ii iiiii ii ii mm i nm nm i mini i 

6 614 0 at cgcgTcCTtTatCaCagGGcGATTGcCCcCCtTTAGAAcTtAAAAAcaCAGTTtTggAAGATGGtG 

-aata-tgCtgtae— cctggtGAtTG-CC-CC-TTaCAacTtAtaAaeac-gTTaTacAgGATGGtG 

JJ36-GATGGTG 

27- TAATGCAGCACCTGCCAATGATTGTCCACCrTTAGAACrrATAAATACTATTATTGAGGATGGTG-02 7 



6 6372 AtATGGTTGAcACAGGCTTTGGTGCTATGAATTrTGCtGAtTTgCAgACCAATAAATCaGATGTTCCt 

i iiiiiiii iimimimnimmim n it n iinnnni mum 

1 6357 AcATCGTTGAtACAGGCTTTGGTGCTATCAATTTTGCAGAcTTaCAaACCAATAAATCgGATGTTCCC 

i iiiii n inn mm in inn i imi i mm iiiiiiii 

3 6189 ATATGGTgGAcACACGaTTTCGTtgcATGGATrTTAaAACATTgCAGGCTAATAAAAGTGATGTTCCt 

imm i 1 1 ii mm nm nn iiiii iiiiiiii iiiiiiii inn 

6 6232 ATATGGTTCATACtGGCrPTGGTGCTATGGAcTrrACTACATTACAGGCTAAcAAAAGTGAaGTTCCa 

iiiiiiii mi iiiiiiii iiinni iimi i inn i i mm i nm 

1 6151 ATATCGTTCATACaGGCrTTGCaCCTATCCAtTrTACTgCtTTACAAGAcACTAAAAGTaAtGTTCCt 

imm iiiii ii i m ii nm im i i ii iiiii nun n i n n 

E 6208 ATATGGTaG ATACtGGaT aTGGtGCcATCG AcTTTAgTaCaTTgC AAGAtACTAAAtGTgAgGTaCC a 

n AtATGGTtgAtACaGGcTtTGGtgctATGgAtTTTact-caTT-CA-gc-AatAAA-gtgAtGTtCCt 
ATATGGTTGATACACGCTTTGCTGCTATCGA-36 37-CATTNCANGCNAATAAANGTGATCrrCCT 
027-ATATGGTGGACACAGGATTTGGTTGCATGGATTTTAAAACATTGCAGGCTAATAAAAGTGATGTTCCT-02 7 



6 6440 aTTGACATaTGTGGCACTac&TGtAAATATCCaGATTATTTaCAAATGGCTCCACACCCaTATGGTGA 

mi ii nm m n mum iiiiiiii miiimmimi iiiiiiii 

1 1 64 2 5 CTTG ATATTTGTCGaJVCTgtcTGCAAATATCC tCAITATTTgCAAATGGCTGCAGACCCTTATGGTGA 

minmnn i i iimmm iiiiiiii nm n n mnnim 

33 6257 aTTGATATTXGTGGcAgTAcaTGCAAATATCCAGATTATTTaAAAATGaCTagtGAgCCTTATGGTGA 

i mmm n iimimimiim i mm n n nm n 

16 6300 cTGGATATTTGTAcaTCTATTTGCAAArATCCAGATTATaTT AAAATGGTgtCaGAaCCATATCGCGA 

mi mini iiiiiiii mmmmiii imnim i n mnmin 

31 62 19 TTGGAcATTTGTAAtTCTATTTGTAAATATCCAGATTATcTTAAAATGGTTGCtGAgCCATATCGCGA 

nm mm i mimmimm mm i nm m n n nm n 

18 6276 TTGGAtATTTGTcAgTCTATTTGrAAATATCCtGATTATtTacAAATGtcTCCaGAtCCtTATGCgGA 

con -T-GAtATtTGTgg— cTattTG-AAATATCCaGATTATtTa-AAATGgctgcaGA-CC-TArGGtGA 

NTNGATATTTGT- J J37 J J 3 8 - AAATATCCAG ATTATTTANAAATGG- J J 3 8 
02 7-ATrGATATTTGTG5CAGTACATGCAAATATCCAGATTATTTAAAAATGAC7AGTGAGCCTTATGGTGA— 02 7 
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6 6508 TACaTTaTTTTTTTtTcTaCCgAACCAACAAATGTTTCCcAGACAtTTTTTTAAcAGCCCtCCcgagG 

iii ii iiiiiii i i u miimmmui inn mum um u i 

11 6493 TAGgTTgT'rTT M TTaTfcTgCGAAAGGAACAAATGTTTGCtAGACAC n I rTT AATACGGCcGGTACtG 

in ii um i i i iii miiimim iiiiiiiiiiiiiiiiiin um 

33 6325 TAGtTTATTTTTcTtTcTtCGAcGtGAACAAATGTTTGTaAGACACTTTTTTAATAGCGC'rGGTACat 

ii mum i i i in i imuiMiiiii um u miiimmiuu 

16 6368 cAGcTTATTTTTTTATTTACGAAGCGAACAAATGTrrCTtAGACATTTaTTTAATAGGGCTGGTACtG 

i immiiimiiii imuiimmiu ii um i mm i i n u i 

31 6287 TACaTTATTTTTTTAMTACGtACGGAACAAATGTTTGTaAGGCATTTTTTTAATAGAtCACGcACgG 

i i i iiiiiii um i ii u i mi iiiiiinii nun mi ii 

18 6344 TtCcaTgTTTTTTTgcTTACGgcGtGAgCAgcTtTTTGctAGGCATTTTTggAATACAgCAGGtACta 

con tag-tTaTTTTTtTattTaCCaaggGAaCAaargTTTG-UkGaCAtTTtTttAAtAGggCtGGtactg 

WO 86/05816-GAGG 

027 -TAGTTTATTTTTCTTTCTTCGACGTGAACAAATGTTTGTAAGACACTTTTTTAATAGGCCTGGTACAT-02 7 
6 6576 TCaGGGAACCTGTGCCTCATacaCTtaTaaTtAAgGGtaGTggaAAtcGcaCgTCTGTAGggAGTAGT 

1 1 1 M i 1 1 1 1 1 1 i 1 1 1 1 1 1 f II I I II II II II I I IIIIIII llllll 

11 6561 TGGGGGAACCTGTGCCTGATGACCTGTTggTaAAACCggGTaatAAeAGatCaTCTGTACctAGTAGT 
I II II llll II Mllllllll I lllll IIIIIII II III 

33 6393 TaGGaGAggCTCTtCCcGATGACCTCTACATTAAAGGtTCaGGaACTACTCCcTCTaTtcaaAGcAGT 

i ii ii iii ii ii ii i mimim ii ii mini i i u 

16 6436 TTGGTGAAaaTGTaCCaGAeGAtTTATACATTAAACGCTCtGCgTCTACTGCaAaTTTAGCcACttca 

mum ii ii ii um milium n n n n i mm i 

31 6355 TTGCTGAAtCgGTcCCTactGACTTATATATTAAAGGCTCcGGTTCaACaGCTACTTTAGCtAaCaGT 

i mu i n in mmmnmii i in i in i i i i u 

18 6412 TgGGTGAeaCtCTgCCTcaateCTTATATATTAAAGGCaCaGGTatgcCtGCTtCacctGgcAgCtGT 

con TgCGtGAa-ctGTgCCtgatgae-Tata-aTtAAaCGctetggtactactgC-tct-tagc-Ag-agt 

TGGGGGAACCTGTGCCTGATACACTTATAATTAAGGGTAGTGGAAATCGCACGTCTGTA-W08 6/05816 
LCR2A-ACCTGTTGGTAAAAGGGGGTAATAA-LCR2A 
LCR2A - -GGACAACCATTTTCCCCCATTATT--LCR2A ' 

LCR 2 B-CAGATCATCTGTACCTAGTAGT 
LCR23' -GTCTAGTACACATCGATCATCA 
LCR3A-ATTTATACATTAAAGGCTCTCCCTC-I.CR3A 
LCR3A ' -AAATATCTAATTTCCCAGACCCAG-LCK3A ' 

LCR3B-TACTGCAAATTTAGCCAGTTCA 
LCR3B ' — ATGACGTTTAAATCGGTCAACT 
LCR4A-CCTTATATATTAAAGGCACAGGTAT-LCR4A 
LCR4A ' -GAATATATAATTTCCGTGTCCATA—1*CR4A ' 

LCR4 B-GCCTGCTTCACCTGCCAGCTGT 
LCR4B' -CCGACCAAGTCCACCGTCCACA 
O27-TAGGAGAGGC7GTTCCCGATGACCTGTACATTAAAGGTTCAGGAACTACTGCCTCTATTCAAAGCAGT-02 7 
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« 6644 XTaTATGTtaAcACcCCgAGcGGCTCtTTGGTGTCeTCtGAGCCaCXATTgTTTAATAAgCCATXTTC 

n him i ii ii ii inn iiiinii ii inn inn imim mum 

11 6629 ATTTATGTaCAtAC4CCTAGTGGCTCXTTG<:TGTCTTC»GACGCTCAATTATTTAATAA»CCATATTe 

ii i i i ii iiiinii iii mi mi ii 1 1 n minimi mum 

33 6461 gcTTtTTTTCCcACtCCTAGTGGaTCAATGGTTACTTCcGAatCTCAgTTATTrAATAAgCCATATTC 

ii imii ii mum u mum u u i u mi urn u urn 

16 6504 AaTTATTTTCCTACACCTAGTGGtTCtATGGTTACeTCtGATGCcCAAATATTcAATAAACCtTATTG 

i ii miiuumu ii u mmu n urn inn n imim urn 

31 6423 AcaTAcTTTCCTACACCTAGcGCCTCcATGGTTACtTCaGXTGCaCAAATtTTTAATAAACCATArrG 

ii i iii i ii u inn ii urn 1 1 n in i unmiimum 

18 6480 gtgTAtTcTCCctCtCCaAGtCGCTCtATtGTTACcTCtGActecCAgtTgTTTAATAAACCATATTG 
con attTattttcc-aCaCCtAGtGGcTCtaTgGTtAC-TCtGA-gC-CAatTaTTtAATAAaCCaTATTG 
AT-LCR2B JJ39-GTTACNTCTCANCCNCAATTATTTAATAAACCATATTG 
TAA-LCR2B' 
AA-LCR3B 
JTA-LCR3B ' 
GT-LCR4B 
CAC-LCR4B ' 

02 7 -GCTTTTTrTCCCACTCCTAGTGGATCAATGGTTACTTCCCAATCTCAGTTATTTAATAAGCCAIATTG-02 7 
6 6712 GCTaCAAAAaCCcCAGGGACATAACAATCGTATTTG tTGGGGtAAtCAaeTGTTTGTTACTGTGGTAG 

ill iiiii u mmiuuiuummi inn n n niniimimnu 

11 6697 GCTtCAAAAgGCtCAGGGACATAACAATGGTATTTGcTGCGGaAAcCActTGTTTCTTACTGTGGTAG 

ill iii ii u ii iiiii minimi iiiii ii ii i mimimini 

33 6529 GCTACAACG tGC ACAaGG tC ATAATAATGGTATTTGTTGGGGc AAtCAggTAT TTGTTACTGTGGTAG 

i iiiiiii urn ii ii muni 1 1 1 1 1 1 1 1 1 1 1 ii ii miuiuiui u i 

16 6 S 7 2 G tTACAACGaGCACAGGGcC ACAATAATGGcATTTGTTGGGG t AACCAacTATTTGTTACTGTtGTtG 

I I IIIII II IIIII 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 M I I I I II II 1 1 1 1 1 i I ! I [ I i I II I 

31 6491 GaTgCAACG tGC tCAGGGaCACAATAATGGTATTTGTTGGGGcAATCAgTTATTTGTTACTGTGGTAG 

i I ii ii iiiii ii ii mm mi in iiiii uuunmuinm 

18 6548 GtTaCAtaagGCaCAGGGtCAtAAcAATGGTgTTTGcTGGcatAATCAaTTATTTCTTACTCTGCTAC 

con GcTaCAa GCaCAgGGaCAtAA-AATGGtaTTTGtTGGggtAAtCAatTaTTTGTTACTGTgGTaG 

CCTACAANNNGCACA-JJ3 9 J4 1 -AATGGTATTTGITGGGGTAATCAATTATTTGTTACTGTGGXAG 
C6-GCMCAGGGWCATAAYAATGG-C6 Cl-CTGTGGTAG 

C7-CTGTTGTTG 
CS-CTGTCGTAG 
CIO- CAGTTGTAG 
CU-CTGTGGTTG 
C 1 2 - CTGTTGTGG 
C13-CTGTTGTAG 
C14-CTGTGGTAG 
C15-CTGTAGTGG 

027-GCTACAACGTGCACAAGGTCATAATAATCCTATTTGTTGGGGCAATCAGGTATTTGTTACTGTCGTAG-027 
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TGCATCcCTaaCTAcATCTtCcACATACACc 
IIIIII II III III! ! llllilll 
TGCATCtGTgtCTAaATCTgCTACATACACt 

II II III IIIIII I 

caCAcaAGTaACTAgtGAcAgTACATATAAA 
III II I llllllllll 
GCcATAtCtACTtcaGAaACTACATATAAA 
II II II II lllllll Itll 

GCAATtgCaAacagtGATACTACATtTAAA 
II I I II I I 

iccTgggcaATaTgAt 



6 67 80 ATACCACACCCACTACcAAcATGACAtTATC 

llllllllllllllll II IIIIII llll 
11 6765 ATACCACACGCAGTACaAATATGACAcTATG 

lllllll llllilll llllilll llll 
33 6597 ATACCACtCGCAGTACtAATATCACtTTATG 

llll II llllilll IIIIII I Mill 
16 6640 ATACtACACGCAGTACaAATATGTCaTTATGTGCT 

iiii urn inn ilium i mm 

31 6559 ATACCACACGtACrACCAATATGTCtgTtTGTCCT 
lllllll I Illllllll I I I IIIIII 

IB 6616 ATACCACtCCCAGTACCAATtTaaCaaTaTGTGCTtctaCAeagtCfcectgtai 

con ATACcACaCgcAGTACcAAtaTgaCatTaTGtgct — tgCa g-»act»-ag-tactacATacaaa 

ATACC - J J4 1 C 1 6-CATCCGTAACTACATCTTCCA-C 1 6 

ATACCACACCCAGTAC -CI C 1 7 -TCTGTGTCTAAATCTGCTACA-C 1 7 

ATACTACACGCAGTAC-C7 C2 0 - C AC ACAAGT AACTAGTG AC AG-C 2 0 

ATACCACTCGCAGTAC-C 8 C 2 3 -CAGTCTCCTGT ACCTGGG-C 2 3 

ATACTACTCGCAGCAC -CIO C3 1 - TTG CAAACAGTG AT ACT AC ATT - C 3 1 

ATACTACCCGTAGTAC-C1 1 
ATACTACCAGAAGCAC-C 1 2 
AT ACT ACT AG AAGC AC -C 1 3 
ATACCACACGTAGTAC-C14 
ACACTACCCGCAGTAC-C 1 S 
02 7 -ATACCACTCGCAGTACTAATATGACTTTATG CACACAAGTAACTAGTGACAGTACATATAAA-02 7 



11 6827 
33 6659 



31 6624 
18 6684 



02 7-J 



AATTCtCATTATAAaGAgTACATGCGtXATGTCGAaGAGTaTGATTTACAaTTTATTTTTCAATTaTG 

urn niiiiii it niiiiii muni mi imnm mmiimni n 

AATTCAGATTATAAgGAATACATGCGcCATGTGGAgGAGTtTGATTTACAGTTTATTTTTCAATTgTG 
III I III III lllll II I lllll II II I llll llllilll llllilll I II 
AATgaAAATTTTAAaCAATAtATAaGACATGTtGAaCAATATGATcTACAGTTTgTTTTTCAACTaTG 

m ii urn ii ii ii mm 11 iiiiiiiii mum mmmi u 

AATAcTAAcTTTAAgGAGTAccTAeGACATGGgCAGGAATATGATTTACAGTTTATTTTTCAACTgTG 

i ii iii iiiii mn ii mini mini iiiiiiiii uiii iiui i ii 

AgTAgTAAtTTTAAaGAGTATtTAAGACATGGTGACGAATtTGATTTACAaTTTATaTTTCAGTTaTG 

ii ii iiiii iiiii mini iiniui nun u urn mum n 

gcTAccAAaTTTAAgcAGTATagcAGACATGtTGAGCAATaTGATTTgCAgTTTATtTTTCAGTTgTG 
aaTa c t aAtTtTAA- gAgTA- a 



6 6910 TAGCATTACATTgTCTGCtGAAGTaATGGCCTATATtCACACAATGAATCCcTCTGTTTTGGAaCACT 

minium mu um umuun umiuiimi niiinim mi 

11 689 5 TAGCATTACATTATCTGCAGAAGTcATCGCCTATATaCACACAATGAATCCtTCTGTTTTGGAgGACT 
I I I II III I II i II II II III I III II II I II I I II II I I I II II II I 
33 672 7 CAAAgTTACCTTAACTGCAGAAGTTATGACATATATtCATgCTATGAATCCagaTATTTTaGAaGAtT 

iiii i uummiiu umuun u m mum i mm u n i 

16 6773 CAAAATAACCTTAACTGCACACGTTATGACATAcATaCATtCTATGAATtCcaCTATTTTGCAgGAcT 
Illllllll III llllilll I llllilll II II lllllll I llllllllll II I 
31 6 69 2 CAAAATAACaTTAtCTCCAGACaTaATCACATATATTCAcAGTATGAATcCtgCTATTTTCGAaCATT 

i u u in mini i in i iiniui iiiiuni mm u nn 

18 6 7 52 tAct ATtACtTTAaCTGCAGAtgTtATGtCcTATATTCAt AGTATG AATagca gTATTTTaGAgGATT 

.-actATGAATccc-cTaTTTTgCA-GA-T 
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6 6 9 7 8 GGAACTTTGGgTTATCGCCTCCcCCAAATGCTACAtTaGAaGATACcTATAGgTATGTgCACTCACAG 

f 1 1 1 1 f 1 1 1 1 Milium! 1 1 1 1 1 1 1 f 1 1 1 1 i ii inn inn mil iiimni 

11 6963 GGAACTTTGGTTTATCGCCTCCACCAAATGCTACAcTgGAGGATAC tTATAGaTATGTaCAGTCACAG 

ii i iiimni i urn m ii ii i 1 1 1 1 m i inn i iii ii iii 

33 679 5 GCcAaTTTGGTTTAaCACCTCCtCCAteTGcTAgttTAeAGGATACcTATAGGTTTGTtaecTCtCAG 

ii i mm ii mm m i i hi mil iiimimi i m 

16 6841 GGAATTTTGGTcTAcaACCTCCcCCAggAGGcACacTAGAaGATACtTATAGGTTTGT aaCcCAG 
I IIIMIMI I I II II III I I II IMM IIIMIMI I I I III 

31 67 60 GGAATTTTGGatTgaCCaCaCCtCCctCAGGTtCTTTCGACGATACcTATACCTTTGTcaccTCaCAG 

mi mil i ii i ii ii i i urn mm m i urn ii 

18 6820 GGAAcTTTGGtgTtcCCcCcCCcCCaaCtacTagTTTGGtGGATACaTATcGtTXIGTacaaTCtgtt 
con GGaAcTTTGGttTa-c-cCtCCcCCaactggtac-tT-gagGATACcTATaGgTtTGTaca-tCacag 

C21-TTGT AACCCAG 
C34-TTTCT AACCCAG 
C35-QTTTGT AACCCAG 



6 704 6 GCCATTACCTGTCAaAAgCCCACtCCTGAAAAgGAAAAgCcaGA 

1 1 1 1 1 1 1 u 1 1 1 1 1 ii urn mmii mil i n 

11 7031 GCCATTACCTGTCAgAAACCCACACCTGAAAAaGAAAAaCAgGA 

II Mill Mill ill I I I I I III IIMI I II 
33 6863 GCtATTACgTGTCAAAAAaCagtACCTCCAAAgGAAAAgGAAGA 

II III I milllll lllim || Mill 

16 6906 GCaATTgCtTGTCAAAAAcaTaCACCTCCAgcaCCtAAaCAACATgatCCCCTTA AAAAATAc AC 

II III I MIIIIMI I III ! I II II MINI II III II I II 
31 6828 GCcATTACaTGTCAAAAAacTGCcCCcCaAaagCCcAAgGAAGAT ' 

II Mill Milllll III II II I III 

18 6888 GCtATTACcTGTCAAAAggaTGCtgCaCcggctgaaAAtaAgGAT 



TCCCTATAAG aAc cTt AGTT 

mm mi i i mi 

TCCCTATAAGgAtaTgAGTT 

Mil I I ' 

CCCCTTAggtAAATAtACaT 

mini inn ii ■ 



CCaTTTA AAgAtTAtgtaT 
II I I I I I 
CCcTaT gAtaAgTtaaagT 



GCcATTaCcTCrCAaAAacct-cacCtc-aoaggaaAAggaaGAt cCccttaaa-aA-t-to-tT 

GCAATTGCT-C21 C18-CATACACCTCCAGCACCTAA-C1 8 JJ4 6-T 

C19-GGATGCTGCACCGGCTGA-C19 
C 2 2 - AAAAACAGT ACCTCCAAAGGA-C2 2 
C 2 7 - TT-mCTCA TCCAGG TTTCCT-C21 

C2 4 -CACACCTGAAAAAGAAAAACAG-C 2 4 
C 2 8 - G rffTGGAC TTTT7CTT TTTOTC- C 2 8 
C25-CTCCTGAAAAGGAAAAGCCA-C2 5 
C 2 6 - GAGCACTTTTCCTTTTCCCr-C 2 6 

C29-CCAAAAGCCCAAGCAAGAT C-C29 
C30-CAAAAGCCCAAGGAAGAT C-C30 
C3 2-CAGAAACCCACACCTGAAAAAGA-C32 
C3 3 - AGAAACCC AC ACCTGAAAAAGAA-C 3 3 
GCAATTGCT-C34 023-GGA 
GCAATTGCT-C35 015-GGAT 
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; 7110 TTTGGGAGGTTAAtTTAAAAGAAAAGTTTTCtAGTGAATtgGATCAGTaTCCttTgGGACGcAAGTTT 

in inn inn mifiiiimiim iiiiiiii iniiii in i inn nun 

, 709S TTTGG<iAGGTTAAcTTAAAAGJLAAAGTTTTCAAGTGAATTAGATCAGTTTCCccTt.GGACGtAAGTTT 

iiiiiii ii i inn inn nun ii Minimi Mi) i mil mm 

I 6927 TTTGGGAAGTggATTTAAAGGAAAAaTTTTCAOCAGAtTTAGATCAGTTTeCTTTgGGACGCAAGTTr 

imiimi 1 1 1 1 1 1 1 1 1 ( i ( i inn inn imimimiiii mum m 

; 6973 TTTGGGAAGTaAATTTAAAGGAAAAGTTTTCTGCAGAccTAGATCAGTTTCCTTTaGGACGCAAATTT 

iiiiiii ii iiiiiiii mimmmmi mmimm i n miimi 

. 6892 TTTCGGAgGTtAATTTAAAaGAAAAGTT?TCTGCAGAtITAGATCAGTTTCCaCTgGGtCGCAAATTT 
Hill I II IIIIIII I I 1 I t I I I I t I 1 III IIIIIIII I III II II II mm 
I 6 952 TTTGGaAtGTggATTTAAAgGAAAAGrrTTCrttAGAcTTACATCAATaTCCcCrtGGaCGtAAATTT 

TTTGGgAgGttaAtTTAAA-GAAAAgTTTTCtgcaGA-tTaCATCAgTtTCCt-TgGGaCGcAA-TTT 
TTTCGGAGGTTAATTTAAANGAAAAGTTTTCTGCAGANTTACATeA-Jr J 4 6 

C2-GATCAGTTTCCYYTKGGACG-C2 
C3 -GATCAGTWT CCX YTKGGACG-C 3 
C7-CTAGTCAKACGJWAMCCrCC-C7 
0 1 5 -TTTGGAATGTCGAlTTAAAGGAAAAGrrTTCTTTAGACTTAGATCAATATCCCCTTGGACGTAAATTT-0 1 5 
O23-TTTGGGAGGTTAACTTAAAAGAAAAGTTTTCAAGTGAATTAGATCAGTTTCCCCTTGGACGTAAGTTT-023 



7163 TTA 

II 



II 



aCAAAGTGGATATaGgGGACGGtCCT 

mimimi i mm i i 

gCAAAGTGGATATcGAGGACGGaCgT 



I 



I 



I 



6995 TTA TTAcagGcAggtettaaaqcAaaAcctaaacttaaACGtgcAGeccccaCAtCcaCCcgCA 

III II Ml I I II II 

L TTA cTACAAGCAGGATtgAaCGCcaaACCAAAATTTAcAttAGGaAAACCaaaagCTaCACCCacCA 

■'■ mi mini i in ii iiiiiii i in urn i inn 

6960 TTA tTACAGGCAGGATatAGGGCacgtCCtAAATTTAAAgCAGGtAAACG TagTgCACCC t 

i urn mi i in i ii urn i 

3 TT ggTtCAGGCtGGATtgcGtcgcaagCCcAccaTaggccCtcGcAAACG T tctg 

TTa — ta — agcaggattgagggcaaaaccaaaaat«a-4-cacg»a»a-g»tatag-gcaccc-cet 
015-TT GGTTCAGGCTGGATTGCGTCGCAAGCCCACCATACGCCCTCGCAAACG T TCTG-015 
02 3-TTA TT CCAAAGTGGATA7CGAGGACGGACGT-023 



7209 CTatTCGTACAGGTgTtAAGCGCCCtCCTCTtTCcAAagCCTCTgCtCCCCCtAAACCtAAgCCcgCC 

ii imnmi i mum urn 11 u nm i urn inn 11 u 11 

7194 CTgCTCGTACAGGTaTaAAGCGCCCaGCTGTgTCtAAgcCCTCTaCAGCCCCcAAACGaAAaCGTaCC 
II III II II I I I 

7039 CaTCgTCTgCAaaacgcAAaaaggttaaaaAATAAcAcTttCtgtaAttgtgtTAtgtTGTtgtTttg 
I II II I I I II II I I I I Ml 

7108 CcTCATCTACCtCTACAACtGCtAAACG cAAAAAACgTAAGctGTAA GtaTTGTATGTA 

i iiiiiiii nm i ii nm iiiiiii hi mi i iiiiiiii 

7021 CagCATCTACCACTACAcCaGCaAAACGtaAAAAAAC TAAAaaGTAAtgGatgTCTATGTAAtaCaT 

i nm iiiiiii i mi ii i ii 

7075 CtcCATCTgCCACTAC gtcttC TAAA ccTGccAagCgT 

Ct-catcTaC-actacaaa r. — a — t-aat-aa-gtaa-ctg-a-cc-ct-a-c-tgta — tcc- 

015-CTCCATCTGCCACTAC GTCTTC TAAA CCTCCCAAGCGT-0 15 

023-CTGCTCGTACAGGTATAAAGCGCCCACCTGTGTCTAAGCCCTCTACAGCCCCCAAACGAAAACGTACC-023 



SO 



ss 
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6 7277 AAAACtAAAAgGTAATATATGTGT aTaTGTacTGTT 

inn mi nun Mini i in mi 

11 7262 AAAACcAAAAaGTAATATATGTCTgtcagTGTGTTGTGTT 

i i iii minimi 

33 7127 TtcTGtcTAtGTactTtgtgTTCT TGTCTTGTGTTgtTGT 

i ii i ii mi inn in 

16 71S7 TgtTGaaTtaCTCT TGtTTCT TGTGT ATATGT 

i i mi ii mi m iimi 

31 70B8 GTGTctgTatGTCTAtGTGCTTCTgcfcgtatTGT ATATGTCTGTCTttgtgtgtTATATA tg 

iii inn im i nimnim mm 

18 7113 GTG cGTGTAcGTGC caGgaa g t aATATGTGTG TGT gtatataTATATAcat 

--tgtGtagtgt-tatgtgtgtgt tatata 

CAGGAAGTAATATGTGTGTGT GTATATATATATACAT - 0 1 5 
-<023) 

6 7313 AT 
II 

11 7302 ATtTaTATG T TGTTGTA gTGTGT 

i i mi i mini inn 

33 7167 TTGT TtTtTgTGTATG TGttacaaTgtATgTTATGTTGTATGTtacTGTGTTTG 

im i i mm n i n in nmmi mini 

16 7199 TTGT ATGTgc tTGTATG TG CTTGTAaATATTAaGTTGTATGT GTGTTTG 

mini mm mm mi i nun in n 

31 7150 gTATaTGTATGTTTaTGTATG C gTGTGT aCTTGTATATAT GtaTaGTATGT TATGTgTG 

m i inn inn i inn mi im i i mi i 

18 7164 cTATtgtTgTCTTT GTATGtCcTCTCTttgtgtTTGT TgTAT G aTtgcattgTATG G 

con -tatttgtatgtttttgtatg-c-tgtgt-tgt-cttgtatatattatgttgtatgtt-gtgtgtttg 

01 5 -CTATTGTTGTCTTT GTATGTCCTGTGTTTGTGTTTGT TGTAT G ATTGCATTGTATG G-015 

023- ATTTATATG T TGTTGTA GTGTGT ( -02 3 ) 

6 7315 aTATATGT 

mini 

11 7325 aTATGT TtCTTGT atTGTG TATATGT 

inn i inn mi i mn 

33 7221 T ttTATGTgTaCTTGTttGTGTGcATGTTcTATGTacttgt 

i mn mi 

16 724 8 TATGTATG gtaTAATAAA cacGTGTGTATGT 

llllllll lllllll I I II I M f I 

31 7209 TATGTATGctatgtaTGTTAATAAAtatgtgtatacctgtgtgtgtTGTGTATGTTGTcctTataTAc 

immi mi miiimi i ii 

18 7221 TATGTATG gtTGTT gtTGTATGTTGTatgTtacTAt 

— ttatgt-ttcttgtt-gtgtgtatgtt-tatgta — tat 

GTTGTATCTTGTATGTTACTAT-0 1 5 
ATATGT TTCTTGT ATTGTG TATATGT ( -023 ) 
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6 7323 GT gTATGTAcTGT 
II llllll III 

1 7351 GTataTGTTTGTGTATATGT GTAT gtTATGTA TGT 

I IIIIIMIIIIIIII llll Mllll III 

3 7262 cagfcTtccTGTTT.GTGTATATGTtaataaaaeattgTg?GTATtTgt.TaaAc-tATttgTATGTA TGT 
II I I II I I I II II I 

6 727 9 GTtTtTaAaTgcTTgtgtAACTATTGT gTcATgcAacATaAaTAaactt&tT 

llll II lllllllli I II 

1 727 7 AccctaTtagtaacatacTaTtAcTAtTTtataAACTATTGTtccTActTgtTcctAcTtgttCCTgc 

II llll I I I I I III 

8 7257 AtttgtTggtatgtggcaTtaaAtaAaaTatgttttgtggtTetgTgtgTtaTgtggtTgcgcCCTag 

con a tat-tgtttgtgtatat-ataatataagaaactatgttttttatgtaatatt — Tatgtactgt 

O15-ArrTGTTGGTATGTGGCATTAAATAAAATATGrrTTGTGGTTCTGTGTGTTA-rGTGGTTGCGCCCTAG-015 
023- GTATATGTTTGTGTATATGT GTAT GTTATGTA TGT-02 3 

6 7336 TATGT aTATGT GTgTGTGTGTTctGTGTGTaatgtaAgtTATTTGTGtAATGTGTATGTgTGTT 

inn urn ii muni mm i mum iimmm mi 

11 7386 TATGrtgtTATGTatGTtTGTGTGTTtaGTGTGT gtArarATTTCTGGAATGTGTATGTATGTT 

Mill I 11 II I llll II llll 

33 7329 TATGT AtatgggtgtaccTataTGaCTAagGagTTgTATTgcTtGccctacCcTCCATTge 

I I II llll I llll 

16 7331 gtTTCaacAcctACtaaTtgTgtTgTggtTaTtcAtTGTATaTaAactaTatTtCctACATeCtgTtc 

II I I I I I I llllll llll 

31 734 5 TccTCccaAtagtCATgTacTTaTtTctgccTatAaTTTAggTgTcacgccaTaGTaAaAgTtgtaca 
I I II I I I I II I II 

15 7325 TgageaacAactgtATtTgtgTtTgTggtatgggtgTTgcttgtTgggctatataTtgtecTgtattt 

can tatgcaa-aa-gt-»ttttgc-fcttT-tgtgtgtaatgtattt.tatttgt-taa-ttgtatgt-tttt 

015-TGAGTAACAACTGTATTTGTGnTCTCGTATCGCTGTTGCTTGTTGGGCTATATATTGTCCTGTATTT-Ol5 
023-TATGTTGTTATGTATGTTTGTGTGTTTAGTGTGT GTATATATTTGTGGAATGTGTATGTATGTT-023 

6 7400 TaTGTGCAATAAACAATTACctcTtgtTacacCCTCT gACtCAGTGgctgttgcacgcGTT'iTCgT 

i mimmmmi i i urn n urn m n i 

11 7450 TtTGTGCAATAAACAATTA TTatgTgtgtCCTGTTACACcCAGTG actaaGTTgTGtT 

III I III I I llll I II 

33 7390 aaTGTaCcTAccTtfcATTtcccTaTAtrtgtAGtaCCTACATGTttaGTattgCtttacCtTTTGaca 
I II I I I II II II III I I I III 

16 739 9 ttgtTTtaTATaTactaTAtTtTgTAgcgcCACgcCCatTTTGTaGCtTCaAcCgaAttCggTTGcat 

I II II I III II I I I I 

31 7413 CccGgTccgtTtTTtgcaACTaaAgctacTCCATTTTgaTTTtatGCagCCAtTTTAaaTcccTAACC 

ill i ii i mum i i m i mi 

18 7393 CaaCtTataaaacTgcacACcttAcagcaTCCATTTTatccTacaatcctCcaTTTtgcTgtgcAACC 

con tatgttcaa-aatt-attaccttata-t-tcc-tt-t-acat-cagtg-c-atttta— cgttt-act 

015-CAAGTTATAAAACTGCACACCTTACAGCATCCATTTTATCCTACAATCCTCCATTTTGCTGTGCAACC-015 
02 3 -TTTGTGCAATAAACAATTA TTATGTGTGTCCTGTTACACCCAGTG ACTAAGTTGTGTT-02 3 

024 -GAATTCGGTTGCAT 
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6 74 66 TTGCACGCGCCtTacacacataagTaATATAcaTgcAcaATATATATaTtTttqtTtaaaATACTAT 
lllllllllll I I Mil I I 111! ill I I I UN 

IX 7508 TTGCACGCGCCgTtTgtgttgccTTCATAT TatAtTATATATATtTgTaataTacCTATACTATg 
II II I I III I I I II II till 

33 74S8 TacTAgTGtCCaTATtgtacaaTTTCcTccattTTgTATGcCTAaccgTtTtcggTtACTTgCCAtac 

mm "i iii ii mi i ii ii ill 

16 7467 GcTTTtTGgCaCAAaaTgTgttTTTttaAaTAgTTCTATGtCagcaacTaTGgTtTaAacTTGTAcGT 

i iii i ii ii i mi iiiii iii hi n 

31 7481 GtTTTCGGTTCCAttgTtTaaacaTgctAgtAcaaCTATGctgatgcagtaGrreTGcggTTtTTgGT 

i minim nil i i i 

18 7461 GarTTCGGTTGC ctttggcTTaTGtctgTggTttT 

con -ttt-cgg-ccctat-t-ta-A-ttc-tataa-t-ctatgt-tatat-ttt-tt-T-actttgct-tt 

0 1 5-GATTTCCGTTGC CTTTGCCTTATGTCTGTGGTTTT-O 1 5 

02 3-TTGCACGCGCCGTTTGTGTTCCCTTCATAT TATATTATATATATTTGTAATATACCTATACTATG-02 3 
02 4-GCTTTTTGGCACAAAATGTG7TTTTTTAAATAGTTCTATGTCAGCAACTATGGTTTAAAC7TGTACGT-02 4 

< 7533 aCttttatAtTTGCAACCGTTTTCGCTTGCCCTTAgCATACACTTtCCaCcAATTTGTTAcAAC 

I i iiiiiiimimiiiiimiii i m mn n i mi mn m 

11 757 3 tTACCCcccccCAcTTGCAACCGTTTTCGGTTGCCCTTA CATACACTTaCCTCaAATTTGTTAtAAC 
Mill II Mil II III till I Ml 

3 3 7526 aTACCCtaTgaCAtTGGCAGaacAgTTaaTccTTTtCTttCCTGCACTGtgtTtgTcTgTACTtgctg 
II I I II I II I IIMI lllllllll I II 

16 7535 TTCCTG cTfcgCcaTGcgtGccaAaTcecTgtTTTcCTgaCCTGCACTG cTTgccaACcaTtcc 
llllll I II I 

31 7549 TTCCTG aaTAcTagTTTttGCcaacaTTCTggcTtgTagt 
II III III II Ml I I 

18 7496 cTgCacaatacagtacgctggcactattgcaaacttTAaTctTTTggCCactgcTcCTacaTatTttg 

eon tt-c-ct-tt-catt-gcagcc— tttcg-tt-ctcttatc-T-cact— e-tcttct-tattata-c 

01S-CIGCACAATACAGTACCCTGGCACTATTGCAAACTTTAATCTTTTGGGCACTGCTCCTACATATTTTG-O15 

023- TTACCCCCCCCCACTTGCAACCGTTTTCGGTTGCCCTTA CATACACTTACCTCAAATTTGTTATAAC -02 3 

024 - TTCCTG CTTGCCATGCGTGCCAAATCCCTGTTTTCCTCACCTGCACTG CTTGCCAACCATTCC-024 

6 7597 GTGTTTc cTc tTAATCC t ATATat tTTGTG CcAGGTACAcATTGCCCTCCCAAGTtgCTTGCCAA 

mm i mm mi nm i iimn iimmmim minn 

11 7640 GTGTTTtgTACTAATCCcATAT gTTGTGtgcCAAGGTACAtATTGCCCTGCCAAGTatCTTGCCAA 

I III Ml II M II I lllll I 

33 7S94 caTTggcaTACatAcCCtATgacatTgGCagaeCAgTtAATcctTTtCTTTcCTgcacTgtgTTtgtc 

I I II I II I I I II I I I 

16 7S98 aTTgTettTtACACtsCacTatgtgc&ACtActgAaTCAcTaTgTaCATTgtgTCataTAAaaTaaaT 

i i i mi i i i mi i in i i n i 

31 7589 tTCcTgccTaACACacCTTgccaaCATATAAtecAgTCCaacTtTCCAATTAtaCtATgAAtCatgtT 

i ii iii mini i ii mi n i 

18 7564 aaCaattggcgCgCctCTTtggcgCATATAA ggCgcaccTGgtATTA gtcATtttcCtgtcc 

01S-AACAATTGGCGCGCCTCTTTGGCGCATATAA GGCGCACCTCCTATTA CTCATTTTCCTGTCC-0 1 5 

02 3-GTGTTTTGTACTAATCCCATAT G-023 

O24-ATTGTTTTTTACACTGCACTATGTGCAACTACTGAATCACTATGTACA7TGTGTCATATAAAATAAAT-02 4 
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6 7662 gtgcatcatatcctgccaaCcACACACCTCGCgcCAGGGtGecCTATTGC cTtactcATAA 

i milium inn iimiini i mi 

11 770 6 _ CAACACAC CTGGC CAGGCeGCGCTATTGCaTgAcTaaTgTAcaATAA 

II I II 1 I I I I I II I 

33 7662 tgtacTtgctgcAttgacTCAtatataCatCCAGtgcaATtgcaaAaTaCTTaATTgtacTAatAgtT 

I II I II II III I I 

16 7 666 caeTaTgcgcCAACgcctTacatACcgCtgtTAGgcacATatTtTTggeTTgTtrTAactAACcTAAT 

I till II I I II II I II I I III 

31 7657 tGtTtaaaTACAACtgtagttcaACtATgtgTcatgcAcaTATATTataTTaTCCTAcAcAcCTTAAA 

i i nun ii i i urn i in 

18 7626 aGgTgcgeTACAAC AATtgcTtgcatAacTATAT ccaetcCCTA AgtaaTaAAA 

con tg-tatg-tacaacgecate-a-acaactgg-agca-aatt-tata-t-cttt-cta-a — actaaaa 

BE31-XXACGCACAXAXXXX-BE31 £pj£l*+I«*JJ 
015-AGGTGCGCTACAAC AATTGCTTGCATAACTATAT CCACTCCCTA AGTAATAAAA-0 1 5 

024-CACTATGCGCCAACGCCTTACATACCGCTGTTAGGCACATArrrtTGGCTTGTTTTAACTAACCTAAT-024 

6 7723 ACCTCTC TTTGTgttAtAcTtTTaTGcAcTGtAGCCAActcTTAAAAGCATTTTTGGCTTgTAGCa 

mini inn i i i n n mi mm mimiimimm mi 

11 77S3 ACCTGTCGgTTTGT ACAaTgTTgTGgArTGCAGCCAAAggTTAAAAGCATrTTTGGCTTcTAGCt 

I I III III II I I I I Mil llll III lllllllll I 

33 7730 TaCAeATCcTTTtaggcACATAtTTTTaetTTaCtttCAAAccTTAAgtGCAGTTTTGCCTT aCa 

I II II II II I III II llll I I 

16 7734 TgCATATTtGGCAtAaggTTTAaacTIcTAaggCcAaCtAAatgTcAccctAGTTCaTaCaTgaActg 

I II llll I III II I I I II I 

31 7725 CTGCTTTTAGGCACATATTTT C-TagaTTATctaTAtCctTgATTCCAqtgcTCGCTTttqcacAtgt 

mmiminiiiiiiii n 111 i i mm mm 

19 7680 CTGCTTTTAGGCACATArrCTaGTttgTTtTtacTtaagcTaATTGCAtactTGGCTT 

con — c-tttta atataat-tagttte-tattgct--caaa--tTaaa-gcattt-t-gcttgtagc- 

BE3 1-XXAGGCACAXAXXXX-BE3 1 hpvlS+l 8*11 
GE31-XXAGGCACAXAXXXX-BE31 hpvlS+l&*33 
0 1 5 -CTGCTTTTAGGCACATATTTTAGTTTGTTTrTACTTAAGCTAA'rrCCATACTTGGCTT- ( 015 ) 
02 * -TGCATATTTGCCATAAGGTTTAAACTTCTAAG3CCAACTAAATGTCACCCTAGTTCATACATGAACTG-02 4 

6 778 9 GcACATTTTTtTgCtCTTAcTgTtTggTa tACAATAaCa t aAAAATGAGTAACCTAAGGTCACACACC 

I nnmi i i mi lit i mm i nmmmmmmimm 

11 7818 GAACATTTTTGTACcCTTAgTaTaTtaTgcACAATAcCcacAAAATCAGTAACCTAACGTCACACACC 

II llllll II I II I M 

33 779 5 cAAttgcTTTGTAtgCcaAac^aTgceTTGTAAAAgtgagtcActacctgttTaTtAccaGGTGTGga 

ii i i mum i mm 

16 7802 TgtAAagGTTAgtcaTacATtgTTCATITCTAAAA cTgeAeatgCGTGTGtg 

I I I I I III llll II I I 
31 7792 TtaAActGccAaggTTatgTcaTgCATTaTaAATAagttgratgttactcaTATAATtaATtgCatAt 

ii iiii i mm ii i i 

18 7738 gtacaactacTTtcaTgtccaAcatTctgTctacccTtaacatgaacTATAAT AlgaCtaAg 

can -aa-attttt-tact-ttatt-tt-a-tttaaaaaaac-gtaaa-tg — tat — taagga-g — ta — 

01 5 - GTACAACTACTTTC ATGTCCAAC ATTCTGTCTACCCTTAACATGAACrATAAT ATG ACTAAG-0 1 5 
024 -TG7AAAGGTTAGTCATACATTGTTCATTTGTAAAA - CTCCACATGGCTGTCTC-024 
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6 7 857 TGCgACCGGTTTCGGTTAtCCACACCCTACATXTTTCCTTCTTATA 
Hi Ml llllllll III lllllilllllllllllllllllllll 
11 7 886 TGCAACCGGTTTCGGTTACCCACACCCTACATA.TTTCCTTCTTATA 
M III III I III II II I I I I 

33 7863 CtAACCG TTTTaGGTcAtaTTggt.CATTTA tAaTCtTTTATATAATA 

i inn mi n nun i minim - 

16 7854 CaAACCGATTTT gGGTTACACATrrACAAGCAAcTTATATAATAATACTaa 

I II I I II I I I I I I I I I I I I I II I Hill 

31 7 860 agCTattAcaccgtTTTcGGTTACAGtTTTACAAGCAAtTGtTCTTtTTATACT 

ii iii i it i ii iii nun 

13 7 800 ctGTgcatacatagTTTatGcaACcGaaaTAggttgggcaGcaCaTacTATACTtttc 

con cg-aac ttt-ggttatg — acccat-tA-a-ttc-tt-ttataataatact 

015-CTGTGCATACATAGTTTATGCAACCGAAATAGGTTGGGCAGCACATACTATACTTTTC- (015 ) 
024 -CAAACCGATTTT GGGTTACACATTTACAAGCAACTTATATAATAATACTAA ( -02 4 ) 



Claims for the following Contracting States : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, NL, SE 



A composition useful in ICR tor amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising a set of four oligonucleotide probes, said probe sets being selected from the group con- 
sisting of the following oligonucleotide sets: 



ICRS: 



SEQ ID No. 
81 
82 
83 
81 



GCTGCAAACA ACTATACATG ATATAA 

TTATATCATG TATAGTTGTT TGCAGc' 

TATTAGAATG TGTOTACTGC AAGCA 

TGCTTGCAGT ACACACAT TC 7AATA • 



LCR6: SEQ ID No. 

85 
86 



CTTCACTGCA AGACATAGAA ATAA 

TTATTTCTAT GTCTTGCAGT GAA, 

CCTGTGTATA TTGCAAGACA GTAT, 

TACTGTCTTG CAATATACAC AGG ; 



SEO ID No. 
89 



91 
92 



TATATTGCAA GACAGTATTG GAAC, 

GTTCCAATAC TGTCTTGCAA TTTA, 

TTACAGAGGT ATTTGAATTT GCATT, 

AATGCAAATT CAAATACCTC TGTAA , and 



SEQ ID No 
93 



GTATGGAACA ACATTAGAAC AGCA, 

TGCTGTTCTA ATGTTGTTCC AT AC , 

ATACAACAAA CCGTTGTGTG ATTT 

AAATCACACA ACGGTTTGTT GTAT 
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A composition according to claim 1 lor amplifying the DNA of human papilloma virus type 1 6 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets: 

LCR5 (SEQ ID Nos. 81 ,62,63 and 84) and LCR8 (SEQ ID Nos. 93, 94, 95 and 96). 

A composition according to claim 1 for amplifying the DNA of human papilloma virus type 18 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets 

LCR 6(SEQ ID Nos. 85,86,87 and 88) and LCR 7 (SEQ ID Nos. 89,90,91 and 92) 

A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
a composition according to any of claims 1 to 3: and 
further comprising a ligase. 

A kit according to claim 4, wherein said ligase is thermostable. 

A composition useful in PCR for amplifying the DNA of human papilloma virus present in a lest sample, said 
composition comprising: 

a first nucleic acid primer of sense direction, capable of hybridizing to the antisense strand of HPV DNA , said 
primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
consisting of the following sequences 

SEQ IP Nfl, • 

1 CAGATGTCTC TGT GGCGGCC TAGTG. 



6 


GAATTAGTTA 


GACCATTTAA 


AAG, 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


61 


GCTGCAAACA 


ACT ATACATG 


ATATAA, 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA. 


89 


TATATTGCAA 


GACAGTATTG 


GAAC and 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA, and 



a second nucleic acid primer of antisense direction, capable of hybridizing to the sense strand of HPV DNA, 
said primer having from 10 to about 30 nucleotides in length and-having a sequence selected from the group 
consisting of the following sequences: 

SEQ IP No. 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

84 TGCTTGCAGT ACACACATTC TAATA, 
88 TACTGTCTTG CAATATACAC AGG , 
92 AATGCAAATT CAAATACCTC TGTAA and 
96 AAATCACACA ACGGT TTGTT GTAl"; 
provided said first and second primers hybridize to their respective antisense and sense strands at locations 
such that their 3' ends do not overlap and, in the direction of extension, the 5' ends of said primers are spaced 
further apart than the 3' ends of said primers. 

A composition according to claim 6 wherein said first and second primers are selected from the following pairs of 
oligonucleotide sequences ( identified by Sequence ID No ): 
1 and 5, 6 and 5, 7 and 5, 81 and 84, 
85 and 88, 89 and 92, and 93 and 96. 

A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
a composition according to claim 6 or 7 ; and 
further comprising a polymerase. 
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9. A kit according to claim B wherein said polymerase is thermostable 

10. A consensus oligonucleotide for hybridizing human papilloma virus types 6, 11, 16, 18, 31, 33 and 61, which 
oligonucleotide comprises (rom about 10 to about 60 nucleotides in length and is selected from the group of se- 
quences consisting of: 



and their complements. 

11. A type-specific oligonucleotide for determining the presence of human papilloma virus type 1 6, having a sequence 
selected from the group consisting of: 



SFO ID NO. 

81 GCTGCAAACA ACTATACATG ATATAA, 

82 TTATATCATG TATAGTTGTT TGCA6C , 

83 TATTAGAATG T6TGTACTGC AAGCA, 

84 TGCTTGCAGT ACACACATTC TAATA, 

93 GTATGGAACA ACATTAGAAC AGCA. 

94 TGCTGTTCTA ATGT TGTTCC ATAC , 

95 ATACAACAAA CCGTTGTGTG ATTT and 

96 AAATCACACA ACG6TTTGTT GTAT; 



and their complements. 

1 2. A type-specific oligonucleotide for determining the presence of human papilloma virus type 1 8, having a sequence 
selectod from the group consisting of: SEQ ID No 



5E<3 'P Nq. 

85 CTTCACTGCA AGACATAGAA ATAA, 

86 TTATTTCTAT GTCTTGCAGT GAA , 

87 CCTGTGTATA TTGCAAGACA GTAT, 

88 TACTGTCTTG CAATATACAC AGG , 

89 TATATTGCAA GACAGTATT6 GAAC , 

90 GTTCCAATAC TGTCTTGCAA TTTA, 

9t TTACAGAGGT ATT TGAATT T GCATT and 

92 AATGCAAATT CAAATACCTC TGTAA; 



and their complements. 

13. A method for determining the presence of any human papilloma virus in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one consensus oligonucleotide selected from the group of 
claim 10, said oligonucleotide being conjugated to a signal generating compound capable of producing a 
detectable signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated 

14. A method for determining the presence of human papilloma virus type 16 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of claim 11, 
said oligonucleotide being coniugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by delecting the signal generated. 



SFO ID No. 



5 
6 

7 



CAGATGTCTC TGTGGCGGCC TAGTG, 
AGGTGTCAGG aaaaccaaat ttatt, 
GAATTAGTTA GACCATTTAA AAG and 
GGGGAAACAC CAGAATGGAT A; 
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15. A method for determining the presence ot human papillomavirus type 18 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of claim 12, 
said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated 

16. A method according to any of claims 13-15, further comprising a step ol amplification prior to or concurrent with 
said hybridizing step. 

17. A method according to claim 16, wherein said amplification step comprises PCR or LCR. 



Claims for the following Contracting State* : ES 

1. A composition useful in LCR for amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising a set of four oligonucleotide probes, said probe sets being selected from the group con- 
sisting of the following oligonucleotide sets. 



81 GCTGCAAACA 

82 T TATATCAT 0 

83 TATTAGAATG 

84 TGCTTGCAGT 



ACT ATACATG ATATAA, 

TATAGTTGTT TGCAGC , 

TGTGTACTGC AAGCA , 

ACACACATTC TAATA; 



SEQ ID No. 

85 CTTCACTGCA AGACAT AGAA ATAA, 

86 TTATTTCTAT GTCTTGCAGT GAA , 

87 CCTGTGTAT A TTGCAAGACA GTAT , 

88 TACTGTCTTG CAATATACAC AGG, 



ICR 7: SEQ ID No 



TATATTGCAA GACAGTATTG GAAC , 

90 GTTCCAATAC TGTCTTGCAA TTTA, 

9 1 TTACAGAGGT ATTTGAATTT GCATT, 

92 AATGCAAATT CAAATACCTC TGTAA ; and 



LCR8: SEQ ID No. 



GTATGGAACA ACATTAGAAC AGCA , 

TGCTGT7CTA ATGTTGTTCC AT AC , 

ATACAACAAA CCGTTGTGTG ATTT. 

AAATCACACA ACGGTTTGTT GTAT. 



2. A composition according to claim 1 for amplifying the DNA of human papilloma virus type 16 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the fol lowing oligonucleotide sets : 

LCR5 (SEQ ID Nos 81 ,82,83 and 84) and LCR8 (SEQ ID Nos 93. 94. 95 and 96). 

3. A composition according to claim l for amplifying the DNA of human papilloma virus type 18 present in a test 
sample, said composition comprising a set of four oligonucleotide probes, said probe sets being selected from the 
group consisting of the following oligonucleotide sets. 

LCR6(SEQ ID Nos. 85,86,87 and 88) and LCR 7(SEQ ID Nos. 89,90,91 and 92). 

4. A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising: 
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a composition according to any of claims 1 to 3; and further comprising a ligase. 

S. A kit according to claim 4, wherein said ligase is thermostable. 

s 6. A composition useful in PCR for amplifying the DNA of human papilloma virus present in a test sample, said 
composition comprising: 

a first nucleic acid primer of sense direction capable of hybridizing to the antisonse strand of HPV DIMA, said 
primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
'<? consisting of the following sequences: 

SEQ ip Nfl, - 

I CAGATGTCTC TGTGGCGGCC TAGTG, 



6 GAATTAGTTA GACCATTTAA AAG , 

7 GGGGAAACAC CAGAATGGAT A, 

8 I GCTGCAAACA ACT AT ACATG ATATAA, 
85 CTTCACTGCA AGACATAGAA ATAA, 
89 TAT AT TGCAA GACAGT ATTG GAAC and 
93 GTATGGAACA ACATTAGAAC AGCA; and 

a second nucleic acid primer of antisense direction, capable of hybridizing to the sense strand of HPV DNA, 
said primer having from 10 to about 30 nucleotides in length and having a sequence selected from the group 
consisting of the following sequences: 



5 


AGGTGTCAGG 


AAAACCAAAT 


TTATT , 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


88 


TACTGTCTTG 


CAATATACAC 


AGG , 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA and 


96 


AAATCACACA 


ACGGTTTGTT 


GT AT ; 



provided said first and second primers hybridize to their respective antisense and sense strands at locations 
such that their 3' ends do not overlap and, in the direction of extension, the 5' onds of said primers are spaced 
further apart than the 3' ends of said primers. 

7. A composition according to claim 6 wherein said first and second primers are selected from the following pairs of 
oligonucleotide sequences (identified by Sequence ID No ): 

1 and 5 6 and 5, 7 and 5, 81 and 84, 
85 and 88, 89 and 92, and 93 and 96 

8. A kit for detecting the presence of human papilloma virus DNA in a test sample, comprising, 
a composition according to claim 6 or 7, and 

further comprising a polymerase. 

9. A kit according to claim 8 wherein said polymerase is thermostable 

10. A method for determining the presence of any human papilloma virus in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one consensus oligonucleotide selected Irom the group of 
sequences consisting of: 
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SFQ LP "q, 



5 
6 
7 



CA6AT6TCTC TGTGGCGGCC TAGTG, 

AGGTGTCAGG AAAACCAAAT TTATT, 

GAATTAGTTA 6ACCATTTAA AAG and 

GGGGAAACAC CAGAA7 GGAT A; 



and their complements, 

said oligonucleotide being con|ugated to a signal generating compound capable o1 producing a detectable 
signal; and 

b. determining the presence ot human papilloma virus by detecting the signal generated. 

11. .A method for determining the presence ot human papilloma virus type 16 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of sequences 
consisting of: 



SEQ ID. 


^0. 






81 


GCTGCAAACA 


ACTATACATG 


ATATAA , 


82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGTGTACT6C 


AAGCA, 


84 


TGCTTGCAGT 


ACACACA7TC 


TAATA , 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA , 


94 


TGCTGTTCTA 


ATGTTGTTCC 


ATAC. 


95 


ATACAACAAA 


CCGTTGTGTG 


ATT T and 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT ; 



and their complements, said oligonucleotide being conjugated to a signal generating compound capable of 
producing a detectable signal: and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

12. . A method tor determining the presence of human papilloma virus typo 18 in a test sample, comprising: 

a. hybridizing DNA in the test sample with at least one oligonucleotide selected from the group of sequences 
consisting of 

SEQ LB up, 



85 


CTTCACTGCA 


AGACATAGAA 


ATAA , 


86 


TTATT TCTAT 


GTCTTGCAGT 


GAA , 


87 


CCTGTGTATA 


TTGCAAGACA 


GTAT , 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


89 


TATAT TGCAA 


GACAGTATTG 


GAAC , 


90 


GTTCCAATAC 


TGTCT TGCAA 


TTTA, 


91 


TTACAGAGGT 


ATTTGAATTT 


GCATT and 


92 


AATGCAAATT 


CAAATACCTC 


.TGTAA; . 



and their complements, 

said oligonucleotide being conjugated to a signal generating compound capable of producing a detectable 
signal; and 

b. determining the presence of human papilloma virus by detecting the signal generated. 

1 3. A method according to any of claims 10-12, furthor comprising a stop of amplification prior to or concurrent with 
said hybridizing step. 

14. A method according to claim 13, wherein said amplification step comprises PCR or LCR. 
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Patentansprucho 



Patentanspruche fur folgende Vartragastaaten : AT, BE, CH, LI, DE, DK, FR, GB, GR, IT, NL, SE 

1. Zusarnmenselzung, die 1ur die LCR ("ligase chain reaction", Ligasekettenreaktion) zur Verviclfachung der DNA 
des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammensetzung einen 
Satz von vior Oligonukleotidsonden umta3t, wobei die Sondensatze aus der Gruppe gewahlt sind. die aus den 
folgenden Oligonukleotidsatzen bestcht. 



LCRS: SEQ ID N r 



GCTGCAAACA ACTATACATG ATA7AA. 

TTatatcatg tatagttgtt tgcagc. 

TaTTAGAATG TGTG"ACTGC AAGCA, 

TGCTTGCAGT ACACACATTC TAATA, 



ScQ ID N r 
85 
86 



CTTCACTGCA AGACATAGAA ATAA, 

ttatttctat gtcttgcagt GAA. 

CCTGTGTAT A TTGCAAGACA GTA" , 
TACTGTCTTG CAATAiACAC AGG ; 



ScQ 10 N r 
89 



TAtATTGCAA GACAGTA7TG GAAC , 

GTTCCAATAC tgtcttgcaa TTTA. 

ttacagaggt atttgaattt gcatt 

aa-gcaaatt caaatacctc tgtaa 



95 
96 



gtatggaaca acattagaac agca, 

tgctgttcta atgt7gttcc atac , 

A'ACAACAAA CCGTTGTGTG ATTT , 

AAA TCACACA ACGGTTTGTT GTAT. 



2. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 16, der in 
omer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaGt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCH5 (SEQ ID Nrn 81 , 82, 83 und 84) und LCRS (SEQ ID Nrn 93, 94, 95 und 96) 

3. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 18, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaBt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen bosteht: 
LCR6 (SEQ ID Nrn 85, 86, 67 und 88) und LCR7 (SEQ ID Nrn B9, 90, 91 und 92). 

4. Kit zum Nachwois der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes um- 
taBt: 

eine Zusammensetzung nach emem der Anspruche 1 bis 3, und des weiteten eine Ligase. 

5. Kit nach Anspruch 4, worin die Ligase thermostabil ist 

6. Zusammensetzung, die bei der PCR ("polymerase chain reaction" Polymerasekettenreaktion) zur Vervielfachung 
der DNA des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammenset- 
zung folgendes umfa(3t: 
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cinen ersten Nukleinsaureprimer. der zur Richtung gleichlaufig ist. welcher zur Hybridisierung an den gegen- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und cine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den tolgenden Sequenzen besteht: 



I CAGAT6TCTC TGTGGCGGCC TAGTG. 



6 


GAATTAGTTA 


gaccatttaa 


AAG , 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


3 I 


GCTGCAAACA 


actatacatg 


ATATAA , 


3S 


'CTTCACTGCA 


AGACATAGAA 


ATAA , 


39 


TATATTGCAA 


GACAGT ATTG 


GAAC und 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA; und 



einen zwaiten Nukleinsaureprimer, der zur Richtung gegonlaufig ist, welcher zur Hybridisierung an den gleich- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nuklaotide lang ist und eine 
Sequenz aufweist die aus der Gruppe gewahlt ist. die aus den lolgenden Sequenzen besteht: 

SEP ID NE 



5 AGGTGTCAGG AAAACCAAAT TTATT , 

84 TGCTTGCAGT ACACACAT T C TAATA, 

38 TACTGTCTTG CAATATACAC AGG . 

92 AATGCAAATT CAAATACCTC TGTAA und 

96 AAA TCACACA ACGGTTTGTT GTAT; 
vorausgesetzt, da(3 der erste und der zweite Primer an ihre jeweiligen gleich- und gegenlaufigen Strange an 
solchon Stellen hybridisieron, daR ihre 3' Endon nicht uborlappen, und daB dio 5 -Enden der Primer in Ver- 
langerungsnchtung welter raumlich abgesetzt smd als die 3'-Enden der Primer. 

7. Zusammensetzung nach Anspruch 6 worm der ersle und zweite Primer aus den lolgenden Paaren von Oligonu- 
kleotidsoquenzen (die durch die Sequenz ID Nr bezeichnet sind) gewahlt sind: 

1 und 5, 6 und 5, 7 und 5, 81 und 84, 
85 und 88. 89 und 92, und 93 und 96. 

8. Kit zum Nachweis der Anwesanheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendos um- 
faf3t: 

eine Zusammensetzung nach Anspruch 6 oder .7 und des weiteren eine Polymerase 



9. Kit nach Anspruch 8, worin die Polymerase thermostabil ist 

10. Consensus-Oligonukleotid zur Hybridisierung der humanen papillomaviren Typ 6, 11, 16, 16, 31, 33 und61, wobei 
das Oligonukleotid ungefahr 10 bis ungelahr 60 Oligonukleotide lang ist und aus der Gruppo von Sequenzen 
gewahlt ist, die aus folgendem besteht 



CAGAIGTCrc TGTGGCGGCC TAGTG 

AGGTGTCAGG AAAACCAAAT TTATT' 

GAATTAGTTA GACCATTTAA AAG ' 

GGGGAAACAC CAGAATGGAT A und 
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und aus deren Komplementen. 

. Typ-spezifisches Oligonukleotid zur Beslimmung der Anwesenheit des humanen Papillomavirus Typ 16, das eme 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus folgendem besteht: 



8 I GCTGCAAACA actatacatg atataa 

82 TTATATCATG TATAGTTGTT TGCAGc" 

83 TATTAGAATG TGTGTACTGC AAGCA 

84 TGCTTGCAGT ACACACATTC TAATa" 

93 GTATGGAACA ACATTAGAAC AGCA . ' 

94 TGCTGTTCTA ATGTTGTTCC ATAC , 

95 ATACAACAAA CCGTTGTGTG ATIT , , 

96 AAATCACACA ACGGTTTGTT GTAT ; 

und aus deren Komplementen. 

12. Typ-spezifisches Oligonukleotid 2ur Bestimmung der Anwesenheit des humanen Papillomavirus Typ 18, das eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus folgendem besteht: 

fif; Q TD Nr 

85 CTTCACTGCA AGACATAGAA ATAA. 

a 6 TTATTTCTAT GTCTTGCAGT GAA 

S7 CCTGTGTATA TTGCAAGACA GTAT, 

aa TACTGTCTTG CAATATACAC AGG 

89 TAT AT TGCAA GACAGTATT.G GAAC. 

90 GTTCCAATAC TGTCTTGCAA TTTA 

g, TTACAGAGGT ATTTGAATTT GCATT und 

92 AATGCAAATT CAAATACCTC TGTAA; 
und aus deren Komplementen 

13. Verfahren zur Bestimmung der Anwesenheit irgendeines humanen Papillomavirus in einer Testprobe, das folgen- 
des umfaGt: 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Consensus-Oligonukleotid, das aus der Grup- 
pe nach Anspruch 10 gewahlt ist, wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiert 
ist, die zur Erzeugung gines nachweisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgowiesen wird 

s Typ 16 in einer Probe, das folgendes 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe nach 
Anspruch 11 gewahlt ist, wobei das Oligonukleotid an eine signalerzougcnde Verbindung konjugiert ist, die 
zur Erzeugung eines nachweisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird 

;r Testprobe, das folgendes 

a. Hybridisieren der DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus dor Gruppe nach 
Anspruch 12 gewahlt ist, wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiort ist, die 
zur Erzeugung eines nachweisbaren Signals befahigt ist, und 
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b. Beslimrnon der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

16. Verfahren nach einam der Anspruche 13-15, das des weiteron einen Vervieltachungsschntt umfaflt, der vor Oder 
in Konkurrenz mit dem Hybridisierungsschritt stattfindet. 

17. Verfahren nach Anspruch 16, worin der Vervielfachungsschritt PCB Oder LCR umfafJl. 



Patenlanspruchefur folgenden Vertragastaat : ES 

1. Zusammensetzung, die fur die LCfl ("ligase chain reaction", Ligasekeltenreaktion) zur Vervielfachung der DNA 
des humanen Papillomavirus nutzlich ist, der in einer Testprobe vorhanden ist, wobei die Zusammensetzung einen 
Satz von vier Oligonukleotidsonden umfafit, wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den 
folgenden Oligonukleotidsatzen besteht 



LCR5: 5EQ !0 N f 



GC7GCAAACA ACTATACATG A7ATAA. 

TTA7A7CA7G TATAQrTGiT TGCA3C. 

TA77AGAA7G TGiGTACTGC AAGCA, 

TGC77GCAG T ACACACAmC TAAlA; 



LCRS: ScQiOMr 



CTTCACTGCA AGACATAGAA ATAA, 

ttatttctat G'CTTGCAGT GAA, 

CC7GTGTATA TTGCAAGACA GTAT . 

TACTG7C7TG CAA7A7ACAC AGO; 



ScO lONt 

39 7A7A77GCAA GACAGTA77G GAAC , 

90 GT7CCAATAC TG7CTTGCAA T77A, 

9 I 77ACAGAGG7 A777GAA777 GCA77 , 

92 AA7GCAAA7 7 CAAA7ACC7C 7G7AA. 



ICRS: StSlONr 

93 G7A7GGAACA 

94 7GC7GT7CTA 

95 A7ACAACAAA 

96 AAA 7CACACA 



ACA77AGAAC AGCA, 

A7GT7G77CC A7AC , 

CCG77G7GTG A7TT, 

ACGG777G77 G7A7. 



2. Zusammensetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus Typ 16, der in 
einer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaflt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCH5 (SEQ ID Nrn 81 , 82, 83 und 84) und LCR8 (SEQ ID Nrn 93, 94, 95 und 96). 

3. Zusamrnonsetzung nach Anspruch 1 zur Vervielfachung der DNA des humanen Papillomavirus TYP 18, der in 
omer Testprobe vorhanden ist, wobei die Zusammensetzung einen Satz von vier Oligonukleotidsonden umfaBt, 
wobei die Sondensatze aus der Gruppe gewahlt sind, die aus den folgenden Oligonukleotidsatzen besteht: 
LCP.6 (SEQ ID Nrn 85, 86, 87 und 08) und LCR7 (SEQ ID Nrn 89, 90, 91 und 92). 

4. Kit zum Nachweis der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendcs um- 
ta(3t: 

eine Zusammensetzung nach einem der Anspruche 1 bis 3, und des weiteren eine Ligaso. 

5. Kit nach Anspruch 4, worin die Ligase thermostabil ist. 
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6. Zusammensetzung, die bei dor PCR ("polymerase chain reaction" polymerasekeltenreaktion) zur Vervielfachung 
der DNAdes humanen Papillomavirus nutzlich ist, der in einor Teslprobe vorhanden ist, wobei die Zusammenset- 
zung lolgendes umfaRt: 

einen ersten Nukleinsaureprimer, der zur Richtung gleichlaufig ist, welcher zur Hybridisierung an den gegen- 
laufigen Strang der HPV-DNA befahigt ist. wobei der Primer 10 bis ungefahr 30 Nukleotide lang ist und eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 

SEP ID Nr 



1 CAGATGTCTC TGT GGCGGCC TAGTG , 



6 GAATTAGTTA GACCATTTAA AAG , 

7 GGGGAAACAC CAGAATGGAT A, 

SI OCTGCAAACA ACTATACATG ATaTAA. 

35 CTTCACTGCA AGACATAGAA ATAA. 

69 TATATTGCAA GACAGTATTG GAAC und 

93 GTa TGGAACA ACATTAGAAC AGCA; und 

einen zweiten Nukleinsaureprimer, der zur Richtung gegenlaufig ist, welcher zur Hybridisierung an den gleich- 
laufigen Strang der HPV-DNA befahigt ist, wobei der Primer 10 bis ungefahr 30 Nukleou'de lang ist und eine 
Sequenz aufweist, die aus der Gruppe gewahlt ist, die aus den folgenden Sequenzen besteht: 

SEP ID Nr 



5 


AGGTGTCAGG 


aaaaccaaat 


TTATT, 


84 


TGCTTGCAGT 


acacacattc 


TAATA , 


aa 


TACTGTCTTG 


CAA TATACAC 


AGG, 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA 


96 


aaatcacaca 


ACGGTTTGTT 


GTAT; 



vorausgesotzt, da!i der erste und der zweite Primer an ihre jeweiligen gleich- und gegenlaufigen Strange an 
solchen Stellen hybridisioren, daR ihre 3'-Enden nicht uberlappen, und da3 die 5'-Enden dor Primer in Ver- 
langerungsrichtung weiter raumlich abgesetzt sind als die 3'-Enden der Primer. 

7. Zusammensetzung nach Anspruch 6, worin dor erste und zweite Primer aus den lolgenden Paaren von Oligonu- 
kleotidsequenzen (die durch die Sequenz ID Nr bezeichnet sind) gewahlt sind: 

1 und 5, 6 und 5, 7 und 5, 81 und 84, 
85 und 88, 89 und 92, und 93 und 96. 

8. Kit zum Nachweis der Anwesenheit der DNA des humanen Papillomavirus in einer Testprobe, das folgendes um- 
faGt: 

eine Zusammensetzung nach Anspruch 6 Oder 7, und des weiteren oine Polymerase. 

9. Kit nach Anspruch 8, worin die Polymerase thermostabil ist. 

10. Voilahren zur Bestimmung der Anwesenheit irgendeines humanen papillomavirus in einer Testprobe, das folgen- 
des umlaflt: 

a. Hybridisieren der DNA in der Testprobe mit wonigstens einem Consensus-Oligonukleotid, das aus der Grup- 
pe von Sequenzen gewahlt ist, die aus folgendem besteht: 
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SEP IP Nr 



5 
6 

7 



CAGATGTCTC tgtggcggcc tagtg, 

aggtgtcagg aaaaccaaat TTATT 

GAATTAGTTa GACCATTTAA AAG and 

GGGGAAACAC CAGAATGGAT A; 



und aus deren Komplamenten, 

wobai das Oligonukleotid an eine signalerzeugende Verbindung konjugiert ist, die zur Erzeugung eines nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

11. Verfahren zur Bastimmung der Anwesenheit des humanen Papillomavirus Typ 16 in einer Probe, das folgendes 
umfaUt: 

a. Hybridisieren der DNA in der Testprobe mil wenigstens einem Oligonukleotid, das aus der Gruppe von 
Sequenzon gewahtt ist, die aus folgendem besteht: 



82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGT GTACTGC 


AAGCA , 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA. 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA, 


94 


TGCTGTTCTA 


ATGTTGTTCC 


ATAC, 


95 


AT ACAACAAA 


CCGTTGTGTG 


ATTT und 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



und aus deren Komplementen, 

wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugiert ist, die zur Erzeugung eines nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird. 

1 2. Varfahren zur Bastimmung dor Anwesenheit des humanen Papillomavirus Typ 1 8 in einer Testprobe, das folgendes 
umfaGt 

a, Hybridisieren dor DNA in der Testprobe mit wenigstens einem Oligonukleotid, das aus der Gruppe von 
Sequenzen gewahlt ist, die aus folgendem besteht 



SEP ID Nr 



GCTGCAAACA 



ACTATACATG ATATAA, 
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S EP id Mr 

85 CTTCACTGCA 

86 TTATTTCT AT 

87 CCTGTGTATA 

88 TACTGTCTTG 

89 TATATTGCAA 

90 GTTCCAATAC 
9 1 TTACAGAGGT 
92 AATGCAAATT 



AGACATAGAA ATAA. 

GTCTTGCAGT GAA , 

TTGCAAGACA GTAT, 

CAATATACAC AGG , 

GACAGTATTG GAAC. 

TGTCTTGCAA TTTA, 

ATT TGAATTT GCATT u ^d 

CAAATACCTC TGTAA; 



und aus deren Komplementen, 

wobei das Oligonukleotid an eine signalerzeugende Verbindung konjugierl ist, die zur Erzeugung einos nach- 
weisbaren Signals befahigt ist, und 

b. Bestimmen der Anwesenheit des humanen Papillomavirus, indem das erzeugte Signal nachgewiesen wird 

13. Verfahren nach einem der Anspruche 10-12, das das weitofen einen Vervielfachungsschritt umlafM, der vor oder 
in Konkurrenz mit dem Hybridisierungsschritt statlfindet 

14. Verfahren nach Anspruch 13, worin der vervielfachungsschritt PCR Oder LCR umfa3t. 



Revendlcationa 



Revondications pour les EtaU contractanta aulvanU : AT, BE, CH, LI, OE, DK, FR, GB, GR, IT, NL, SE 

1 . Composition utile dans la LCH pour amplifier I'ADN de virus du papillome humain present dans echantillon a dosor, 
ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, lesdits ensembles de sondes 
otant s^lectionnes dans le groupe constitu.3 par les ensembles (^'oligonucleotides suivants : 



LCR5 : n* d'identification 

81 GCTGCAAACA ACTATACATG ATATAA, 

82 TTATATCATG TATAGTTGTT TGCAGC, 

83 TATTAGAATG TGTGTACTGC AAGCA, 

84 TGCTTGCAGT ACACACATTC TAATA ; 



n* d'identification 

85 CITCACTGCA AGACATAGAA ATAA, 

86 TTATTTCTAT GTCTTGCAGT GAA, 

87 CCTGTGTATA TTGCAAGACA GTAT, 

88 TACTGTCTTG CAATATACAC AGG ; 
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TATATTGCAA GACAGTATTG GAAC 
GTTCCAATAC TGTCTTGCAA TTTA, 
TTACAGAGGT ATTTGAATTT GCATT, 
AATGCAAATT CAAATACCTC TGTAA ; ct 

LCR8 : n* d'idcntification 

93 GTATGGAACA ACATTAGAAC AGCA, 

94 TGCTGTTCTA ATGTTGTTCC ATAC, 

95 ATACAACAAA CCGTTGTGTG ATTT, 
% AAATCACACA ACGGTTTGTT GTAT. 



LCR7 : n* d'idcntification 



2. Composition salon la ravendication 1, destines a amplifier I'ADN de virus du papilloma humain da type 16 present 
dans un echantillon a doser, ladite composition comprenanl un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles d'oligonucleotides 
suivants 

ss LCR5 (n° d'identification 81 , 82, 83 at 84) et LCR8 (n« d'identification 93, 94, 95 et 96) 

3. Composition salon la ravendication 1 , destinee a amplifier I'ADN de virus du papilloma humain de type 18 present 
dans un echantillon a doser, ladite composition comprenant un ensemble de quatre sondes olicjonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles d'oligonucleotides 

30 suivants : 

LCR6 (n° d'identification 85, 86, 87 et 88) et LCR7 (n" d'identification 89. 90, 91 et 92), 

4. Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser. comprenant 
une composition selon I'une quelconque des revendications 1 a 3, et en outre une ligase 

35 

5. Kit selon la revendication 4, dans lequel ladite ligase est thermostable. 

6. Composition utile dans la PCR pour amplifier I'ADN de virus du papillome humain present dans un echantillon a 
doser, ladite composition comprenant : 

une premiere amorce d'acide nucleique de direction sens, capable de s'hybrider au brin antisens de I'ADN de 
HPV, ladite amorce ayant de 1 0 a environ 30 nucleotides de long et une sequence selectionnee dans le groupe 
constitue par los sequences suivantes : 

is N* d'idcntification 



1 


CAGATGTCTC 


TGTGGCGGCC 


TAGTG, 


6 


GAATTAGTTA 


GACCATTTAA 


AAG, 


7 


GGGGAAACAC 


CAGAATGGAT 


A, 


81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 


89 


TATATTGCAA 


GACAGTATTG 


GAACct 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA ; ct 



une deuxieme amorce d'acide nucleique de direction antisens, capable de s'hybrider au brin sens de I'ADN 
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de HPV, ladite amorce ayant de 10 a environ 30 nucleotides de long et une sequence selectionnee dans le 
groupe constitue par los sdquences suivantes : 

N* d'identification 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

84 TGCTTGCAGT ACACACATTC TAATA, 

88 TACTGTCTTG CAATATACAC AGG, 

92 AATGCAAATT CAAATACCTC TGTAA ct 

96 AAATCACACA ACGGTTTGTT GTAT ; 

pour autant que lesdites premiere et dauxieme amorces s'hybrident a leurs brins respectifs antisens et sens 
ades emplacements telsque leurs extremites 3' ne se chevauchent pas et que, dans la direction d'extension, 
les extremites 5' desdites amorces soient plus espacees qje les extremites 3' dosdites amorces. 

7. Composition selon la revendication 6, dans laquelle lesdites premiere et deuxieme amorces sont selectionnees 
parmi les paires suivantes de sequences oligonucleotidiques (identifies par leur numero d'identification) : 

1 et 5, 6 et 5, 7 ot 5, 81 et 84, 
85 et 88, 89 et 92, et 93 et 96. 

8. Kit de detection de la presence d'ADN de virus du papillome hurnain dans un echantillon a doser, comprenant 

une composition selon la revendication 6 ou 7 et en outre une polymerase. 

9. Kit selon la revendication 8, dans lequel ladite polymerase est thermostable 

1 0. Oligonucleotide consensus pour hybridation du virus du papillome hurnain des types 6, 1 1 , 1 S, 1 8, 31 , 33 et 61 , 
lequel oligonucleotide a d'environ 10 a environ 60 nucleotides de long et est selectionne dans le groupe de se- 
quences constitue par : 

N* ^identification 

1 CAGATGTCTC TGTGGCGGCC TAGTG, 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

6 GAATTAGTTA GACCATTTAA AAGet 

7 GGGGAAACAC CAGAATGGAT A; 

et leurs complements 

11. Oligonucleotide specifique d'un type, destine a determiner la presence du virus du papillome hurnain de type 16, 
ayant une sequence selectionnee dans le groupe constitue par 

N" d'identification 



81 


GCTGCAAACA 


ACTATACATG 


ATATAA, 


82 


TTATATCATG 


TATAGTTGTT 


TGCAGC, 


83 


TATTAGAATG 


TGTGTACTGC 


AAGCA, 


84 


TGCTTGCAGT 


ACACACATTC 


TAATA, 


93 


GTATGGAACA 


ACATTAGAAC 


AGCA 


94 


TGCTGTTCTA 


ATGTTGTTCC 


ATAC, 


95 


ATACAACAAA 


CCGTTGTGTG 


ATTTet 


96 


AAATCACACA 


ACGGTTTGTT 


GTAT; 



et leurs complements 

1 2. Oligonucleotide specifique d'un type, destine a determiner la presence du virus du papillome hurnain de type 1 8. 
ayant une sequence selectionnee dans le groupe constitue par : 
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NT ^identification 



85 


CTTCACTGCA 


AGACATAGAA 


ATAA, 


86 


TTATTTCTAT 


GTCTTGCAGT 


GAA, 


87 


CCTGTGTATA 


TTGCAAGACA 


GTAT, 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


89 


TATATTGCAA 


GACAGTATTG 


GAAC, 


90 


GTTCCAATAC 


TGTCTTGCAA 


TTTA, 


91 


TTACAGAGGT ATTTGAATTT 


GCATTct 


92 


AATGCAAATT CAAATACCTC 


TGTAA ; 



13. Procede de determination de la presence d'un virus quelconque du papillome humain dans un echantillon a doser, 
comprenant . 

a I'hybridation de I'ADN dans I'echantillan a doser avec au moins un oligonucleotide consensus selectionne 
dans le groupe selon la revendication 10, ledit oligonucleotide etant conjugue a un compose emetteur d'un 
signal, capable de produire un signal detectable, et 

b. la determination de la presence du virus du papillome humain par detection du signal emis 

14. Procede de determination de la presence du virus du papillome humain de type 16 dans un echantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide selectionne dans le 
groupe selon la revendication 11, ledrt oligonucleotide etant conjugue a un compose emetteur d'un signal, 
capable de produire un signal detectable, et 

b. la determination de la presenco du virus du papillome humain par detection du signal emis. 

15. Procede de determination de la presence du virus du papillome humain de type 1B dans un echantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide selectionne dans le 
groupe selon la revendication 12, ledit oligonucleotide etant conjugue a un compose emetteur d'un signal, 
capable de produire un signal detectable, et 

b. la determination de la presonce du virus du papillome humain par detection du signal emis. 

16. Precede selon une quelconque des revendications 13 a 15, comprenant en outre une etape d'amplitication avant 
ou pendant ladite etape d'hybridation. 

17. Procede selon la revendication 16, dans lequel ladite etape d'amplitication comprend la PCR ou la LCR. 



Hevendicatlon* pour I'Etat contractant suivant : ES 

1 . Composition utile dans la LCR pour amplifier I' AON de virus du papillomo humain present dans echantillon a doser, 
ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, lesdits ensembles de sondes 
etant seleclionnes dans le groupe constitue par les onsembles d'oligonucleotides suivants : 

LCR5 : n* ^identification 

81 GCTGCAAACA ACTATACATG ATATAA, 

82 TTATATCATG TATAGTTGTT TGCAGC, 

83 TATTAGAATG TGTGTACTGC AAGCA, 

84 TGCTTGCAGT ACACACATTC TAATA ; 
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LCR6 : n* ^identification 

85 CITCACTGCA AGACATAGAA ATAA, 

5 86 TTATTTCTAT GTCTTGCAGT GAA, 

87 CCTGTGTATA TTGCAAGACA GTAT, 

88 TACTGTCTTG CAATATACAC AGG ; 

10 

LCR7 : n* d'identification 

89 TATATTGCAA GACAGTATTG GAAC 

15 90 GTTCCAATAC TGTCTTGCAA TTTA, 

91 TTACAGAGGT ATTTGAATTT GCATT, 

92 AATGCAAATT CAAATACCTC TGTAA ; ct 



LCR8 : n* d'idcntification 

93 GTATGGAACA 

94 TGCTGTTCTA 

95 ATACAACAAA 

96 AAATCACACA 



ACATTAGAAC AGCA, 
ATGTTGTTCC ATAC, 
CCGTTGTGTG ATTT, 
ACGGTTTGTT GTAT. 



2. Composition solon la rovondication 1 , destineo a amplifior I'ADN do virus du papillomo humain do lypo 16 prosont 
dans un echantillon a doser, ladite composilion comprenant un ensemble de qualre sondes oligon jcleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles d'oligonucleotides 
suivants : 

35 LCR5 (n° d'identification 31 , 82. 83 et 84) et LCR8 (n» ^identification 93, 94. 95 el 96). 

3. Composition selon la revendication 1 destinee a amplifier I'ADN de virus du papillome humain de type 1 8 present 
dans un echantillon a doser, ladite composition comprenant un ensemble de quatre sondes oligonucleotidiques, 
lesdits ensembles de sondes etant selectionnes dans le groupe constitue par les ensembles d'oligonucleotides 

LCR6 (n° d'identification 85, 86, 87 et 88) et LCR7 (n» d'identification 89, 90. 91 et 92). 

4. Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser, comprenant : 

une composition selon I'une quelconque des revendications 1 a 3, et en outre une ligase 

45 

5. Kit selon la revendication 4, dans lequel ladite ligase est thermostable. 

6. Composition utile dans la PCR pour amplifier I'ADN de virus du papillome humain present dans un echantillon a 
doser, ladite composition comprenant : 

50 

une premiere amorce d'acide nucleique de direction sens, capable de s'hybrider au brin antisens de I'ADN de 
HPV, ladite amorce ayant de 1 0 a environ 30 nucleotides de long et une sequence selectionnee dans le groupe 
constitue par les sequences suivantes : 
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N* ^identification 



CAGATGTCTC TGTGGCGGCC TAGTG, 

6 GAATTAGTTA GACCATTTAA AAG, 

7 GGGGAAACAC CAGAATGGAT A, 

81 GCTGCAAACA ACTATACATG ATATAA, 

85 CTTCACTGCA AGACATAGAA ATAA, 

89 TATATTGCAA GACAGTATTG GAACct 

93 GTATGGAACA ACATTAGAAC AGCA ; et 



une deuxieme amorce d'acide nucleique de direction antisens, capable de s'hybrider au bnn sens de I'ADN 
de HPV, ladite amorce ayant de 10 a environ 30 nucleotides da long et une sequence selectionnee dans le 
groupe constitue par les sequences suivantes : 

N* ^identification 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

84 TGCTTGCAGT ACACACATTC TAATA, 

88 TACTGTCTTG CAATATACAC AGG, 

92 AATGCAAATT CAAATACCTC TGTAAet 

96 AAATCACACA ACGGTTTGTT GTAT; 

pour autant que lesdites premiere et deuxieme amorces s'hybndent a leurs brins respertifs antisens el sens 
a des emplacements tels que leurs extremitas 3' ne se chevauchent pas et que, dans la direction d extension, 
les extremities 5' desdites amorcos soient plus espacees que les extremitas 3' desdries amorces 

7. Composition selon la revendication 6, dans laquelle lesdites premiere et deuxieme amorces sont selectionnees 
parmi les paires suivantes de sequences oligonucleotidiques (identifies par leur numero d'identification) . 

1 et 5, 6etS, 7 et 5, 81 et 84, 
85 et 88, 89 et 92, et 93 et 96. 

8. Kit de detection de la presence d'ADN de virus du papillome humain dans un echantillon a doser, comprenant : 

une composition selon la revendication 6 ou 7 et 
en outre une polymerase 

9. Kit selon la revendication 8, dans lequel ladite polymerase est thermostable, 

10. Precede de determination de la presence d'un virus quelconque du papillome humain dans un echantillon a doser 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moins un oligonucleotide consensus selectionne 
dans le groupe de sequences constitue par : 

N* d'identification 

1 CAGATGTCTC TGTGGCGGCC TAGTG, 

5 AGGTGTCAGG AAAACCAAAT TTATT, 

6 GAATTAGTTA GACCATTTAA AAG et 

7 GGGGAAACAC CAGAATGGAT A; 

at leurs complements, ledit oligonucleotide etant conjugue a un compose emetteur d un signal, capable de 
produire un signal detectable, et 

b, la determination de la presence du virus du papillome humain par detection du signal omis. 



11. Procede de determination de la presence du virus du papillome humain do type 16 dans un dchantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a doser avec au moms un oligonucleotide selectionne dans le 
groups do sequences constitue par : 

N* ({'identification 

81 GCTGCAAACA ACTATACATG ATATAA, 

82 TTATATCATG TATAGTTGTT TGCAGC, 

83 TATTAGAATG TGTGTACTGC AAGCA, 

84 TGCTTGCAGT ACACACATTC TAATA, 

93 GTATGGAACA ACATTAGAAC AGCA, 

94 TGCTGTTCTA ATGTTGTTCC ATAC, 

95 ATACAACAAA CCGTTGTGTG ATTTct 

96 AAATCACACA ACGGTTTGTT GTAT ; 
et leurs complements, 

ledit oligonucleotide etant conjugue a un compose emetteur d'un signal, capable de produire un signal detec- 
table, et 

b. la determination de la presence du virus du papilloma humain par d6tection du signal emis. 

12. Procede de determination de la presonce du virus du papillome humain de type 1B dans un cchantillon a doser, 
comprenant : 

a. I'hybridation de I'ADN dans I'echantillon a dosor avoc au moins un oligonucleotide selectionne dans le 
groups de sequences constitue par . 

N* d'identification 
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CTTCACTGCA 


AGACATAGAA 


ATAA, 


86 


TTATTTCTAT 


GTCTTGCAGT 


GAA, 


87 


CCTGTGTATA 


TTGCAAGACA 


GTAT, 


88 


TACTGTCTTG 


CAATATACAC 


AGG, 


89 


TATATTGCAA 


GACAGTATTG 


GAAC, 


90 


GTTCCAATAC 


TGTCTTGCAA 


TTTA, 


91 


TTACAGAGGT 


ATTTGAATTT 


GCATTet 


92 


AATGCAAATT 


CAAATACCTC 


TGTAA ; 



at leurs complements, 

ledit oligonucleotide etant conjugue a un compose emetteur d'un signal, capable de produire un signal detec- 
table, et 

b. la determination de la presence du virus du papillome humain par detection du signal emis. 

13. Procede selon une quelconque des revendications 10 a 12, comprenant en outre une etape d'amplification avant 
ou pendant ladite etape d'hybridation 

14. Procede selon la revendication 13, dans lequel ladite etape d'amplification comprend la PCR ou la LCR. 
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